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TITLE: POLYPEPTIDES OF ALICYCLOBACILLUS SP. Modtaget 
FIELD OF THE INVENTION 

The present invention relates to functional polypeptides encoded by polynucleotides comprised 
in the genome of Alicyclobacillus sp. deposited under deposit accession number DSM 15716. 
5 The invention relates further to the polynucleotides and constructs of such polynucleotides en- 
coding such polypeptides or facilitating their expression as well as to method for preparing the 
polypeptide. Still further the invention relates to compositions comprising the polypeptide and 
to uses of the polypeptide. 

BACKGROUND OF THE INVENTION 

10 Some enzymes from the genus of Alicyclobacillus species are known such as described in 
Matzke et al.; Gene cloning, nucleotide sequence and biochemical properties of a cytoplasmic 
cyclomaltodextrinase (neopullulanase) from Alicyclobacillus acidocaldahus ATCC 2700; re- 
classification of a group of enzymes; Submitted (MAR-1999) to the EMBL/GenBank/DDBJ da- 
tabases or Koivula et al.; Cloning and sequencing of a gene encoding acidophilic amylase from 

1 5 Bacillus a cidocaldarius. J. G en. M icrobiol. 1 39:2399 { 1 993) o r B artolucci e t a L; Thioredoxin 
from Bacillus acidocaldahus: characterization, high-level expression in Escherichia coli and 
molecular modeling: Biochem. J. 328:277 (1997) or Tsuruoka et al.; Collagenolytic Serine- 
Carboxyl Proteinase from Alicyclobacillus sendainensis Strain NTAP-1: Purification, Charac- 
terization, Gene Cloning, and Heterologous Expression; Submitted (MAY-2002) to the 

20 EMBL/GenBank/DDBJ databases; Eckert K. & Schneider E., A thermoacidophilic endogluca- 
nase (celB) from Alicyclobacillus acidocaldahus displys high sequence similarity to arabinofu- 
ranosidases belonging to famili 51 of glycosyl hydrolases', Eur. J. Biochem., 270: 3593-3602, 
2003. 

In the pursuit of novel enzymes it is also known to screen for such new enzymes by 
25 subjecting potential candidates to specific enzyme assays. This approach is limited to the 
availability of enzyme assays and does not allow the identification of functional enzymes or 
polypeptides for which the activity is still unknown. 

Further, whole genome sequencing is a known method to obtain the information on all 
genes from a given microorganism e.g. as described in Fleischmann et al.; Whole genome se- 
30 quences and assembly of Haemophilus influenzae Rd\ Nature 269: 496- 512; (1995). 

Most enzymes for industrial use are enzymes which are secreted to the medium by a 
microorganism. However, only a few percent of a microorganisms' genome encodes secreted 
proteins. For example only approx. 4% of the Bacillus subtilis genome or its closest relatives 
encode secreted proteins (Van Dijl et al.: Protein transport pathways in Bacillus subtilis: a ge- 
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nome-based road map] in "Basillus subtilis and it's closest relatives" - From genes to celts: 
p.337-355; A. L Sonenshein (ed.); ASM Press 2002). 

One disadvantage of genome sequencing is that the vast majority of the obtained se- 
quences encode non secreted proteins. 
5 Also known is signal trapping which is a method to identify genes including nucleotides 

encoding a signal peptide using a translational fusion to an extracellular reporter gene lacking 
its own signal (WO 01/77315). 

SUMMARY OF THE INVENTION 

The present inventors have found a strain of Alicyclobacillus namely Alicyclobacillus sp. 

10 DSM 1 5716 which grows at low pH (approx 4-5) a nd at high temperature (50-60 ° C). This 
strain is interesting because the phylogenetic distance between the public known strains and 
strain DSM 15716 is significant and because the growth conditions are similar to conditions for 
several applications for industrial enzymes. 

This strain was subjected to signal trapping in accordance with WO 01/77315 and 

15 those clones in a gene library of Alicyclobacillus sp. DSM 15716 t that encode secreted or sur- 
face-displayed polypeptides or those that contain surface displayed domains was identified 
without having to redone a library in a screening-vector and without having to screen the li- 
brary in traditional labour- and time consuming activity assays. Thereby rapid determination 
and efficient industrial exploitation of those genes of Alicyclobacillus sp. DSM 15716 encoding 

20 functional secreted polypeptides and/or proteins can be made. 

The genome of a microorganism contains thousands of different genes; some encoding 
polypeptides some coding for RNAs. Only a limited number of the genes in the genome of a 
microorganism encode functional polypeptides which are secreted by the microorganism to the 
surrounding medium serving an external purpose for the microorganism. Such polypeptides 

25 are interesting for industry from the point of view that such polypeptides may be produced in 
considerable amounts in continuous processes without destroying the cells producing the 
polypeptides. 

It is an object of the present invention to identify and provide polypeptides secreted 
from Alicyciobacilius sp. deposited under deposit accession number DSM 15716 and thus 
30 serves a real functional purpose for the Alicyclobacillus sp. because such polypeptides may 
not only be used for a industrial purposes; they may also be produced in industrially relevant 
processes and amounts. 

The present invention thus provides in a first aspect an isolated polypeptide selected 
from the group consisting of: 
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(a) a polypeptide comprising an amino acid sequence which has at least 90% identity with 
a sequence of a mature polypeptide comprised in the group of SEQ ID NO: 26 to SEQ 
ID NO:50; 

(b) a polypeptide which is encoded by a nucleotide sequence which hybridize under high 
5 stringency conditions with a polynucleotide probe selected from the group consisting of 

(i) the complementary strand to a nucleotide sequence selected from the group of 
regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide. 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from the group of regions of SEQ ID NO: 1 to SEQ ID NO: 25 

10 encoding a mature polypeptide 

(c) a fragment of a mature polypeptide comprised in SEQ ID NO:26 to SEQ ID NO:50 
and 

15 wherein the polypeptide has a function of the corresponding mature polypeptides comprised in 
SEQ ID NO:26 to SEQ ID NO:50. 

In a further aspect the invention provides an isolated enzyme selected from the group consist- 
ing of: * 

20 (a) an enzyme comprising an amino acid sequence which has at least 90% identity with 
the amino acid sequence of a mature enzyme selected from the group consisting of 
acid endoglucanase or acid celluiase, aspartyl protease, multi copper oxidase, serine- 
carboxyl protease, serine protease or HtrA-like serine protease, disulfide isomerase, 
gamma-D-glutamyl-L-diamino acid endopeptidase, endo-beta-N- 

25 acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or phytase or 

phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase 
secreted from t he strain of A licyclobacillus sp. Deposited u nder D SM a ccession N o. 
15716 

30 (b) a polypeptide which is encoded by a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 
(i) the complementary strand to a nucleotide sequence comprised in the strain of 
Aficyctobacillus sp. Deposited under DSM accession No. 15716 encoding a ma- 
ture enzyme selected from the group consisting of acid endoglucanase or acid 
35 cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl protease, 

serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 
glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, 

3 
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peptidyl-prolyl-isomerase, acid phosphatase or phytase or phospholipase C, 
polysaccharide deacetylase or xylan deacetylase and sulfite oxidase secreted 
from that strain; 

the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences comprised in the strain of Alicyclobacillus sp. Deposited under DSM 
accession No. 15716 encoding a mature enzyme selected from the group con- 
sisting of acid endoglucanase or acid cellulase, aspartyl protease, multi copper 
oxidase, serine-carboxyl protease, serine protease or HtrA-like serine protease, 
disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or 
phytase or phospholipase C, polysaccharide deacetylase or xylan deacetylase 
and sulfite oxidase secreted from that strain 

a fragment of a mature enzyme selected from the group consisting of acid endogluca- 
nase or acid cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl prote- 
ase, serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 
glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, peptidyl- 
prolyl-isomerase, acid phosphatase or phytase or phospholipase C, polysaccharide 
deacetylase or xylan deacetylase and sulfite oxidase secreted from the strain of Alicy- 
clobacillus sp. Deposited under DSM accession No. 15716, 

wherein the enzyme have a function selected from acid endoglucanase or acid celJulase, as- 
partyl protease, multi copper oxidase, serine-carboxyl protease, serine protease or HtrA-like 
25 serine protease, disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or phytase or 
phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase 

In further aspects the invention provides a polynucleotide encoding the polypeptide of 
30 the invention; a nucleotide construct comprising the polynucleotide encoding the polypeptide, 
operably linked to one or more control sequences that direct the production of the polypeptide 
in a host cell; a recombinant expression vector comprising the nucleotide construct of the in- 
vention and to a recombinant host cell comprising the nucleotide construct of the invention. 

In still further aspects the invention provides a method of preparing a polypeptide of the 
35 invention comprising: 

(a) cultivating a strain comprising a nucleotide sequence encoding a polypeptide of the in- 
vention which strain is capable of expressing and secreting the polypeptide and 

4 
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(b) recovering the polypeptide. 

In still further aspects the invention provide a composition comprising a polypeptide of 
the invention and a method for preparing such a composition comprising admixing the poly- 
5 peptide of the invention with an excipient. 

In still further aspects the invention provides use of the polypeptide of the invention or a 
composition comprising said polypeptide in various applications. 

BRIEF DESCRIPTION OF DRAWINGS 

No drawings. 

10 SEQUENCE LISTING 

The present application contains information in the form of a sequence listing, which is ap- 
pended to the application and also submitted on a data carrier accompanying this application. 
The contents of the data carrier are fully incorporated herein by reference. The regions of SEQ 
ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide encodes the mature polypeptides 
15 of SEQ ID NO:26 to SEQ ID NO:50. The region of SEQ ID NO: 1 encoding a mature polypep- 
tide thus encodes the mature polypeptide sequence comprised in SEQ ID NO:26, the region of 
SEQ ID NO:2 encoding a mature polypeptide encodes the mature polypeptide comprised in 
SEQ ID NO:27 and so on. 

20 DETAILED DESCRIPTION OF THE INVENTION 
Definitions 

The term "identity" as used herein, is to be understood as the homology between two 
amino acid sequences or between two nucleotide sequences. For purposes of the present in- 

25 vention, the degree of identity between two amino acid sequences was determined by using 
AlignX in the program of Vector NTI ver. 7.1 (Informax inc., 7600 Wisconsin Avenue, Suite 
#1100, Bethesda, MD 20814, USA). Amino acid alignment was created using the Clustal W 
algorithm (Nucleic Acid Research, 22 (22): 4673-4680, 1994). The following additional parame- 
ters are used: Gap opening penalty of 10, Gap extension penalty of 0.05, Gap separation pen- 

30 alty range of 8. Pairwise alignment parameters were Ktuple = 1, gap penalty = 3, gap length 
opening penalty = 10, gap extension penalty - 0.1, window size = 5 and diagonals = 5. The 
degree of identity between two nucleotide sequences is determined using the same algorithm 
and software package as described above for example with the following settings: Gap penalty 
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of 10, and gap length penalty of 10. Pairwise alignment parameters is Ktuple=3, gap pen- 
alty=3 and windows=20. 

The term "functional polypeptide" as used herein in the context of the present invention 
5 means a polypeptide which can be expressed and secreted by a cell and which constitutes an 
operational unit capable of operating in accordance with the function it is designed to fullfill by 
the cell. Optionally, co-factors may be required for the polypeptide to adopt the intended func- 
tion. One example of functional polypeptides is catalytically active polypeptides or enzymes 
which help the cell catalyzing reactions in the environment surrounding the cell. Another ex- 
10 ample could be polypeptides which serve as signal substance. Further examples are polypep- 
tides which function as sensors (receptors) for environmental parameters (chemicals in the en- 
vironment surrounding the cell) or polypeptides, which are active against other organisms (an- 
timicrobial (poly)peptides) or polypeptides, which contributes to the structural integrity of the 
cell. 

15 

The term "mature region" as used herein about portion of an amino acid sequences or poly- 
peptide means the portion or region or domain or section of the amino acid sequences or poly- 
peptide which is the mature functional polypeptde. 

20 The term "region of nucleotide sequence encoding a mature polypeptide" as used herein 
means the region of a nucleotide sequence counting from the triplet encoding the first amino 
acid of a mature polypeptide to the last triplet encoding the last amino acid of a mature poly- 
peptide. 

25 Polypeptides of the invention 

The polypeptides of the invention are all polypeptides secreted by Aficyclobaciltus sp. DSM 
15716 with the purpose of serving a function for that particular cell. 

Employing the method disclosed in WO 01/77315 on the genome of Alicyclobacillus sp. 
strain as deposited it was surprisingly found that among the thousands of potential genes, the 

30 polynucleotides of this genome encoded 25 secreted functional mature polypeptides com- 
prised in SEQ ID NO: 26 to SEQ ID NO:50. The method of WO 01/77315 is for purposes of the 
present invention herein denoted or referred to as the Transposon Assisted Signal Trapping 
method or the TAST method. 

In view of the TAST methodology, which is further discussed below it could be con- 

35 eluded that these 25 polypeptides were indeed functional polypeptides synthesised and se- 
creted b y t he A licyclobacillus sp. strain f or a p articular p urpose. This finding s eparated the 
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polypeptides and polynucleotides of the present invention e.g. from random fragments of poly- 
peptides or polynucleotides. 

The genes identified by the TAST procedure are functional genes, which are translated 
into functional polypeptides by the chosen host cell. The rationale behind this methology is to 
5 fuse all genes within a selected genome with a gene encoding a signalless beta-lactamase via 
a transposon tag; Hence when growing host cell clones comprising the genes of a genome 
fused with a gene encoding a signalless beta-lactamase via a transposon tag in an arnpicillin 
containg medium only those clones expressing and secreting a beta-lactamase will survive. 
However the beta-lactamase will only be secreted if the gene to which the beta-lactamase 

10 gene is fused has an intact promoter and ribosome binding site (i.e. a gene which is expressed 
by the cell to produce a polypeptide in real life), which can be recognized in the host strain, 
and if the beta-lactamase is translated so that the synthesized polypeptide is transported 
across the cytoplasma membrane and folded correctly. Hence, when inserting the fused gene 
into a selected host cell, those clones, which are arnpicillin resistant contains a gene which en- 

15 codes a functional secreted polypeptide. 

Usually, when employing the TAST methodology it is even not necessary to express 
the entire gene. When tagging the genes with a transposon, expression of the N-terminal part 
of the genes as protein fusion shows that the genes contain intact transcription, translation and 
secretion sequences. Hence expression of the N-terminal part of the genes as protein fusion is 

20 usually regarded as sufficient for assuring expression and secretion of the entire gene. 

Thus it can be concluded that the genes obtained by the TAST method actually do en- 
code secreted functional polypeptides. For the 25 genes it was shown by sequence analysis, 
that in-frame fusions between with the signalless beta-lactamase gene were obtained. In addi- 
tion, the intactness of the genes' open reading frame was confirmed by determining the entire 

25 nucleotide sequence. 

By comparing homology or identity of the sequences of the 25 polypeptide sequences 
to known sequences the particular function of the polypeptides were annotated. Thus it was 
found that of the 25 secreted functional polypeptides, 15 could surprisingly be showed to be 
enzymes, while 10 polypeptides had an enzyme or other function 

30 It was indeed an unforeseeable discovery that Alicyclobacillus sp. DSM 15716 should 

express and secrete the functional mature polypeptides comprised in SEQ ID NO: 26 to SEQ 
NO: 50 and that, in the genome of that particular strain, the regions of SEQ ID NO: 1 to SEQ 
ID NO: 25 encoding a mature polypeptide would be the genes encoding the mature polypep- 
tides comprised in SEQ ID NO: 26 to SEQ NO: 50. Further in a particular embodiment the 

35 genes encoding the mature polypeptides comprised in of SEQ ID NO:26 to SEQ NO: 50 can 
all be expressed and their corresponding mature polypeptides can be secreted when culturing 
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an E. coli host transformed with polynucleotides comprising those regions of SEQ ID NO: 1 to 
SEQ ID NO: 25 encoding a mature polypeptide. 

It was indeed surprising that the mature polypeptides comprised in SEQ ID NO: 26 to 
SEQ ID NO: 50 in particular shoufd be revealed as polypeptides secreted from the Alicycloba- 
5 cillus sp. strain, because these properties could not be derived from studying the genome as 
such. 

Hence the invention provides an isolated polypeptide selected from the group consist- 
ing of: 

(a) a polypeptide having an amino acid sequence which has at least 90% identity with an 
10 amino acid sequence selected from the group consisting of the mature polypeptides 

comprised in SEQ ID NO: 26 to SEQ ID NO:50; 



(b) a polypeptide which is encoded by a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 
15 (i) the complementary strand to a n ucleotide sequence selected from the group 

consisting of regions of SEQ JD NO: 1 to SEQ ID NO: 25 encoding a mature 
polypeptide, 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a 
20 mature polypeptide and 



(c) a fragment of mature polypeptides of SEQ ID NO:26 to SEQ ID NO:50, 

and wherein the polypeptide exhibits the function of the corresponding mature polypeptide of 

SEQ ID NO: 26 to SEQ ID NO:50. 

25 

In one particular embodiment the polypeptide of the invention is selected among the 
enzymes secreted by Alicyclobacillus sp. deposited under DSM accession No. 15716 and iso- 
lated by the present inventors, i.e. the group of enzymes consisting of acid endoglucanase, 
acid cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl protease, serine prote- 
30 ase, HtrA-like serine protease, disulfide isomerase, gamma-D-glutamyl-L-diamino acid 
endopeptidase, endo-beta-N-acetylglucosamtnidase, peptidyl-prolyl-isomerase, acid phos- 
phatase, phytase, phospholipase C, polysaccharide deacetylase, xylan deacetylase and sulfite 
oxidase. 



35 Hence, the invention also provides an isolated enzyme selected from the group consisting of: 

(a) an enzyme comprising an amino acid sequence which has at least 90% identity with 

the amino acid sequence of a mature enzyme selected from the group consisting of 

8 



acid endoglucanase or acid cellulase, aspartyl protease, multi copper oxidase, serine- 
carboxyl protease, s ertne p rotease or HtrA-like serine protease, d isulfide isomerase, 
gamma-D-glutamyl-L-diamino acid endopeptidase, endo-beta-N- 

acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or phytase or 
phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase 
secreted from the strain of A licyclobacillus sp. Deposited under DSM accession No. 
15716 

a polypeptide which is encoded by a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 

(i) the complementary strand to a nucleotide sequence comprised in the strain of 
Aficyclobacillus sp. Deposited under DSM accession No. 15716 encoding a ma- 
ture enzyme selected from the group consisting of acid endoglucanase or acid 
cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl protease, 
serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 
glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, 
peptidyl-prolyl-isomerase, acid phosphatase or phytase or phospholipase C, 
polysaccharide deacetylase or xylan deacetylase and sulfite oxidase secreted 
from that strain; 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences comprised in the strain of Aficyclobacillus sp. Deposited under DSM 
accession No. 15716 encoding a mature enzyme selected from the group con- 
sisting of acid endoglucanase or acid cellulase, aspartyl protease, multi copper 
oxidase, serine-carboxyl protease, serine protease or HtrA-like serine protease, 
disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or 
phytase or phospholipase C, polysaccharide deacetylase or xylan deacetylase 
and sulfite oxidase secreted from that strain 

a fragment of a mature enzyme selected from the group consisting of acid endogluca- 
nase or acid cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl prote- 
ase, serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 
glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, peptidyl- 
prolyl-isomerase, acid phosphatase or phytase or phospholipase C, polysaccharide 
deacetylase or xylan deacetylase and sulfite oxidase secreted from the strain of Alicy- 
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clobacilfus sp. Deposited under DSM accession No. 15716, 

wherein the enzyme have a function selected from acid endoglucanase or acid cellulase, as- 
partyl protease, multi copper oxidase, serine-carboxyl protease, serine protease or HtrA-like 
5 serine protease, disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or phytase or 
phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase 

In a particular embodiment the enzyme is an isolated enzyme selected from the group 
10 consisting of: 

(a) an enzyme having an amino acid sequence which has at least 90% identity with an 
amino acid sequence selected from mature enzymes comprised in SEQ ID NO: 26 to 
SEQ ID NO:40; 

(b) an enzyme which is encoded by a nucleotide sequence which hybridize under high 
15 stringency conditions with a polynucleotide probe selected from the group consisting of 

(i) the complementary strand to a nucleotide sequence selected from the group of 
regions of SEQ ID NO: 1 to SEQ ID NO: 15 encoding the mature enzyme, 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from regions of SEQ ID NO: 1 to SEQ ID NO: 15 encoding 

20 the mature polypeptide 

(c) a fragment of the mature enzyme comprised in SEQ ID NO:26 to SEQ ID NO:40, and 

wherein the enzyme has a function of the corresponding mature polypeptides comprised in 
SEQ ID NO:26 to SEQ ID NO:40 

25 

The polypeptide of the invention is an isolated polypeptide meaning in particular that a 
preparation of the polypeptide of the invention contains at the most 90% by weight of other 
polypeptide material with which it may be natively associated (lower percentages of other 
polypeptide material are preferred, e.g. at the most 80% by weight, at the most 60% by weight, 

30 at the most 50% by weight, at the most 40% at the most 30% by weight, at the most 20% by 
weight, at the most 10% by weight, at the most 9% by weight ,at the most 8% by weight, at the 
most 6% by weight, at the most 5% by weight, at the most 4% at the most 3% by weight, at the 
most 2% by weight, at the most 1% by weight and at the most 7 2 % by weight). Thus, it is pre- 
ferred that the isolated polypeptide of the invention is at least 92% pure, i.e. that the polypep- 

35 tide of the invention constitutes at least 92% by weight of the total polypeptide material present 

in the preparation, and higher percentages are preferred such as at least 94% pure, at least 

95% pure, at least 96% pure, at least 96% pure, at least 97% pure, at least 98% pure, at least 

10 
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99%, and at the most 99.5% pure. In particular, it is preferred that the polypeptide of the inven- 
tion is in "essentially pure form 1 ', i.e. that the polypeptide preparation is essentially free of other 
polypeptide material with which It Is natively associated. This can be accomplished, for exam- 
ple, by preparing the polypeptide of the invention by means of well-known recombinant rneth- 
5 ods. 

The polypeptide of the invention of the invention may be synthetically made, naturally 
occurring or a combination thereof. In a particular embodiment the polypeptide of the invention 
may be obtained from a microorganism such as a prokaryotic cell, an archaeal cell or a eu- 
karyotic cell. The cell may further have been modified by genetic engineering 
10 In a particular embodiment, the polypeptide of the invention is an enzyme exhibiting op- 

timum enzyme activity at a temperature within the range from about 10°C to about 80 d C, par- 
ticularly in the range from about 20°C to about 60°C. 

In a particular embodiment the polypeptide of the invention is an enzyme, which is 
functionally stabile at a temperature of up to 100 °C, in particular up to 80 °C, more particularly 
15 upto60°C. 

In a particular embodiment the polypeptide of the invention is an enzyme exhibiting at 
least 20%, in particular at least 40%, such as at least 50%, in particular at least 60%, such as 
at least 70%, more particularly at least 80%, such as at feast 90%, most particularly at least 
95%, such as about or at least 100% of the enzyme activity of an enzyme selected from ma- 

20 ture enzymes comprised in SEQ ID NO: 26 to SEQ ID NO: 50. 

In a particular embodiment the polypeptide of the invention comprises, contains or con- 
sists of an amino acid sequence which has at least 90% identity with a polypeptide sequence 
selected from the group consisting of mature polypeptides comprised in SEQ ID NO: 26 to 
SEQ ID NO: 50; particularly at least 95%, e.g. at least 96%, such as at least 97%, and even 

25 more particularly at least 98%, such as at least 99% or even 100% identity. 

In another particular embodiment the polypeptide of the invention comprises, contains 
or consists of an amino acid sequence, which has at least 50% identity with a polypeptide se- 
quence selected from the group consisting of mature polypeptides comprised in SEQ ID NO: 
26 to SEQ ID NO: 50; particularly at least 60%, particularly at least 65%, particularly at least 

30 70%, particularly at least 75%, particularly at least 80%, and even more particularly at least 
85% identity. 

In a particular embodiment, the amino acid sequence of the polypeptide of the inven- 
tion differs by at the most ten amino acids (e.g. by ten amino acids), in particular by at the most 
five amino acids (e.g. by five amino acids), such as by at the most four amino acids (e.g. by 
35 four amino acids), e.g. by at the most three amino acids (e.g. by three amino acids), in particu- 
lar by at the most two amino acids (e.g. by two amino acids), such as by one amino acid from 
the mature polypeptides comprised in SEQ ID NO: 26 to SEQ ID NO: 50. 
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The polypeptide of the invention may be a wild-type polypeptide isolated from a natural 
source such as the strain AlicyctobaciHus sp. DSM 15716 or another wild type strain, however 
the present invention also encompass artificial variants, where a polypeptide of the invention 
has been mutated for example by adding, substituting and/or deleting one or more amino acids 
5 from said polypeptide while retaining the function of the polypeptide and/or other properties. 
Hence, the polypeptide of the invention may be an artificial variant, wherein at least one substi- 
tution, deletion and/or insertion of an amino acid has been made to an amino acid sequence 
comprising or consisting of the mature polypeptide comprised in SEQ ID NO: 26 to SEQ ID 
NO: 50. 

10 Such artificial variants may be constructed by standard techniques known in the art 

usually followed by screening and/or characterization. Standard techniques includes classical 
mutagenesis, e.g. by UV irradiation of the cells or treatment of cells with chemical mutagens as 
described by Gerhardt et al. (1994); in vivo gene shuffling as described in WO 97/07205; in 
vitro shuffling as described by Stemmer, (1994) or WO 95/17413, random mutagenesis as de- 

15 scribed by Eisenstadt E. et al., (1994); PGR techniques as described by Poulsen et al. (1991); 
family shuffling as described by J.E. Ness, et al, Nature Biotechnology, vol. 17, pp. 893-896 
(1999); site-directed mutagenesis as described by Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. A general description of nu- 
cleotide substitution can be found in e.g. Ford et al, 1991, Protein Expression and Purification 

20 2. p. 95-107. 

Such standard genetic engineering methods may also be used prepare a diversified li- 
brary of variant nucleotide sequences from the genes encoding one or more parent enzymes 
of the invention, expressing the enzyme variants in a suitable host cell and selecting a pre- 
ferred variant(s). A diversified library can be established by a range of techniques known to the 

25 art (Reetz MT; Jaeger KE, in Biocatalysis - from Discovery to Application edited by Fessner 
WD, Vol. 200, pp. 31-57 (1999); Stemmer, Nature, vol. 370, p.389-391, 1994; Zhao and Ar- 
nold, Proc. Natl. Acad. Sci., USA, vol. 94, pp. 7997-8000, 1997; or Yano et al., Proc. Natl. 
Acad. ScL, USA, vol. 95, pp 5511-5515, 1998). 

In a particular embodiment of the invention, amino acid changes (in the artificial variant 

30 as well as in wild-type enzyme) are of a minor nature, that is conservative amino acid substitu- 
tions that do not significantly affect the folding and/or activity of the protein; small deletions, 
typically of one to about 30 amino acids; small amino- or carboxyl-terminal extensions, such as 
an amino-terminal methionine residue; a small linker peptide of up to about 20-25 residues; or 
a small extension that facilitates purification by changing net charge or another function, such 

35 as a poly-histidine tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids (ar- 
ginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino 
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acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine, valine and 
methionine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino 
acids (glycine, alanine, serine and threonine). Amino acid substitutions which do not generally 
alter and or impair the function of a protein are known in the art and are described, for exam- 
5 pie, by H. Neurath and R.L. Hill, 1979. In, The Proteins, Academic Press, New York. The most 
commonly occurring exchanges are Ala/Ser, Val/lle, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, 
Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ue, Leu/Val, Ala/Glu, and 
Asp/Gly as well as these in reverse. 

In a particular embodiment the amino acid changes are of such a nature that the phys- 

10 ico-chemical properties of the polypeptides are altered. For example, amino acid changes may 
be performed, which improve the thermal stability of the enzyme, which alter the substrate 
specificity, which changes the pH optimum, and the like. 

Particularly, the number of such substitutions, deletions and/or insertions in the poly- 
peptide of the invention, particularly in those polypeptides selected from the group consisting 

15 of mature polypeptides comprised in SEQ ID NO: 26 to SEQ JD NO: 50 to produce an artificial 
variant is at the most 10, such as at the most 9, e.g. at the most 8, more preferably at the most 
7, e.g. at the most 6, such as at the most 5, most preferably at the most 4, e.g. at the most 3, 
such as at the most 2, in particular at the most 1. 

In a particular embodiment the artificial variant is a variant, which has an altered, pref- 

20 erably reduced, immunogenicity, especially allergenicity, in animals including man as com- 
pared to a parent enzyme. The term "immunogenicity" in this context is to be understood as 
the artificial variant capability of invoking a an altered, in particular reduced, immunological re- 
sponse when administered to an animal, including intravenous, cutaneous, subcutaneous, oral 
and intratracheal administration. The term "immunological response" in this context means that 

25 the administration of the artificial variant causes an alteration in the immunoglobolin levels in 
the animal body, such as in IgE, IgG and IgM or an alteration in the cytokine level in the animal 
body. Methods for mapping immunogenic/antigenic epitopes of a protein, preparing variants 
with altered immunogenicity and methods for measuring an immunological response is well 
known to the art and are described e.g. in WO 92/10755, WO 00/26230, WO 00/26354 and 

30 WO 01/31989. The term "allergenicity" in this context is to be understood as the artificial vari- 
ant ability of invoking an altered, in particular reduced, production of IgE in an animal as well 
as the ability to bind IgE from said animal. Particularly allergenicity arising from intratracheal 
administration of the polypeptide variant to the animal is particularly of interest (also known as 
respiratory allergenicity). 

35 In a further embodiment, the polypeptide of the invention is a polypeptide which is en- 

coded by nucleotide sequences which hybridize under at least high stringency conditions, par- 
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ticularly under very high stringency conditions with a polynucleotide probe selected from the 
group consisting of 

(i) the complementary strand to a nucleotide sequence selected from the group of regions 
of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide, 
5 (ii) the complementary strand to the cDNA sequence contained in a nucleotide sequences 
selected from regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypep- 
tide 

(iii) a fragment of (i) or (ii) encoding a secreted polypeptide having the function of the cor- 
responding mature polypeptide comprised in SEQ ID NO: 26 to SEQ ID NO: 50 

10 

(J. Sambrook, E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual, 
2d edition, Cold Spring Harbor, New York). 

In particular, the polypeptide of the invention is encoded by a polynucleotide comprising 
a nucleotide sequence selected from the group of regions of SEQ ID NO: 1 to SEQ ID NO: 25 

15 encoding a mature polypeptide or a sequences differing therefrom by virtue of the degeneracy 
of the genetic code. More particularly, the polypeptide of the invention is encoded by a polynu- 
cleotide consisting of a nucleotide sequence selected from the group of regions of SEQ ID NO: 
1 to SEQ ID NO: 25 encoding a mature polypeptide or a sequence differing there from by vir- 
tue of the degeneracy of the genetic code. 

20 The nucleotide sequences of regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a 

mature polypeptide or a subsequence thereof, as well as the amino acid sequences of the ma- 
ture polypeptides comprised in SEQ ID NO: 26 to SEQ ID NO: 50 or a fragment thereof, may 
be used to design a polynucleotide probe to identify and clone DNA encoding enzymes of the 
invention from strains of different genera or species according to methods well known in the 

25 art. In particular, such probes can be used for hybridization with the genomic or cDNA of the 
genus or species of interest, following standard Southern blotting procedures, in order to iden- 
tify and isolate the corresponding gene therein. Such probes can be considerably shorter than 
the entire sequence, but should be at least 15, preferably at least 25, more preferably at least 
35 nucleotides in length, such as at least 70 nucleotides in length. It is; however, preferred that 

30 the polynucleotide probe is at least 100 nucleotides in length. For example, the polynucleotide 
probe may be at least 200 nucleotides in length, at least 300 nucleotides in length, at least 400 
nucleotides in length or at least 500 nucleotides in length. Even longer probes may be used, 
e.g., polynucleotide probes which are at least 600 nucleotides in length, at least 700 nucleo- 
tides in length, at least 800 nucleotides in length, or at least 900 nucleotides in length. Both 

35 DNA and RNA probes can be used. The probes are typically labelled for detecting the corre- 
sponding gene (for example, with 32 P, 3 H, 35 S, biotin, or avidin). 
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Thus, a genomic DNA or cDNA library prepared from such other organisms may be 
screened for DNA, which hybridizes with the probes described above and which encodes en- 
zymes of the invention. Genomic or other DNA from such other organisms may be separated 
by agarose or polyacrylamide gel electrophoresis, or other separation techniques. DNA from 
5 the libraries or the separated DNA may be transferred to, and immobilized, on nitrocellulose or 
other suitable carrier materials. In order to identify a clone or DNA which has the required ho- 
mology and/or identity or is homologous and/or identical with of nucleotides selected from re- 
gions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide, the carrier material 
with the immobilized DNA is used in a Southern blot. 

10 For purposes of the present invention, hybridization indicates that the nucleotide se- 

quence hybridizes to a labelled polynucleotide probe which again hybridizes to a nucleotide 
sequence selected from regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature poly- 
peptide under high to very high stringency conditions. Molecules to which the polynucleotide 
probe hybridizes under these conditions may be detected using X-ray film or by any other 

15 method known in the art. Whenever the term "polynucleotide probe" is used in the present con- 
text, it is to be understood that such a probe contains at least 15 nucleotides. 

In an interesting embodiment, the polynucleotide probe is the complementary strand of 
a nucleotide sequence selected from regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a 
mature polypeptide. 

20 In another interesting embodiment, the polynucleotide probe is the complementary 

strand of a nucleotide sequence which encodes an enzyme selected from the group of SEQ ID 
NO: 26 to SEQ ID NO: 50. In a further interesting embodiment, the polynucleotide probe is the 
complementary strand of a mature polypeptide coding region of a nucleotide sequence se- 
lected from regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide. 

25 For long probes of at least 100 nucleotides in length, high to very high stringency condi- 

tions are defined as prehybridization and hybridization at 42°C in 5X SSPE, 1.0% SDS, 5X 
Denhardt's solution, 100 microgram/ml sheared and denatured salmon sperm DNA. following 
standard Southern blotting procedures. Preferably, the long probes of at least 100 nucleotides 
do n ot c ontain m ore t han 1 000 n ucleotides. F or I ong p robes o f a 1 1 east 1 00 n ucleotides i n 

30 length, the carrier material is finally washed three times each for 15 minutes using 0.1 x SSC, 
0.1% SDS at 60°C {high stringency), in particular washed three times each for 15 minutes us- 
ing 0.1 x SSC, 0.1% SDS at 68°C (very high stringency). 

Although not particularly preferred, it is contemplated that shorter probes, e.g. probes 
which are from about 15 to 99 nucleotides in length, such as from about 15 to about 70 nucleo- 

35 tides in length, may be also be used. For such short probes, stringency conditions are defined 
as prehybridization, hybridization, and washing post-hybridization at 5°C to 10°C below the cal- 
culated Tm using the calculation according to Bolton and McCarthy (1962, Proceedings of the 
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the National Academy of Sciences USA 48:1390) in 0.9 M NaCI, 0.09 M Tris-HCI pH 7.6, 6 mM 
EDTA, 0.5% NP-40, 1X Denhardfs solution, 1 mM sodium pyrophosphate. 1 mM sodium 
monobasic phosphate, 0.1 mM ATP, and 0.2 mg of yeast RNA per ml following standard 
Southern blotting procedures. 
5 For short probes which are about 15 nucleotides to 99 nucleotides in length, the carrier 

material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 15 min- 
utes using 6X SSC at 5°C to 10°C below the calculated Tm. 

SEQ ID NO: 26 acid endoqlucanase or acid cellulase 

10 In a particular embodiment the polypeptide of the invention is an acid endoglucanase or acid 
cellulase comprising or consisting of an amino acid sequence which has at least 90%, particu- 
larly at least 95%, more particularly at least 96%, more particularly at least 97%, more particu- 
larly at least 98%, more particularly at least 99% or most particularly 100% identity with an acid 
endoglucanase or acid cellulase obtainable from Alicyclobaccillus sp., in particular that strain 

15 of AficyclobaciHus sp. Deposited under DSM accession No. 15716, more particularly the ma- 
ture acid endoglucanase or add cellulase comprised in SEQ ID NO: 26. More specifically the 
mature acid endoglucanase or acid cellulase comprise or consists of the sequences from posi- 
tion 25 to 959 of SEQ ID NO: 26. In the present context an acid endoglucanase is defined as 
enzyme, which endohydrolyzes 1 ,4-beta-D-glucosidic linkages in cellulose, lichenin or cereal 

20 beta-D-glucans particularly at acidic conditions. In the present context a acid cellulase is de- 
fined as enzyme, which endohydrolyzes 1,4-beta-D-glucosidic linkages in cellulose, particularly 
at acidic conditions. 

SEQ ID NO: 27 aspartyl protease 

In a particular embodiment the polypeptide of the invention is an aspartyl protease comprising 
25 or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with an aspartyl protease ob- 
tainable from Alicyclobacillus sp., in particular that strain of Alicycfobacilfus sp. Deposited un- 
der DSM accession No. 15716, more particularly the mature aspartyl protease comprised in 
30 SEQ ID NO: 27. More specifically the mature aspartyl protease comprise or consists of the se- 
quences from position 33 to 272 of SEQ ID NO: 27. In the present context an aspartyl protease 
is defined as defined as an enzyme, which hydrolyses proteins or peptides, and which contains 
two aspartyl residues in the catalytic site. 
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Surprisingly, the aspartyl protease comprised in SEQ ID NO: 27 is the first isolated aspartyl 
protease, in particular family A4 protease, isolated from a bacterium. Furthermore, surprisingly 
this aspartyl protease differs from known fungal aspartyl proteases by the absence of 
disulphide bridges in the molecule. The aspartyl protease comprised in SEQ ID NO: 27 
5 contains only one Cystein and thus no disulphide bridges in the protease structure as 
compared to e.g. SEQ ID NO: 55 dislosing a known fungal aspartyl protease, which are 
composed of two peptides crosslinked by 2 disulphide bridges. Hence, the aspartyl protease of 
Alicyclobacillus sp. specifically that deposited under DSM accession No. 15716 a second 
propeptide is missing and thus requires one less maturation step less in its production. This is 
10 an advantage for the cellular production. Family A4 proteases are known to the art as 
proteases having a Asp/Asp Asp/Glu configuration in the active site. 

SEQ ID NO: 28 or SEQ ID NO:35 Multi copper oxidase 

In a particular embodiment the polypeptide of the invention is a multi copper oxidase compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 

15 more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with multi copper oxidase ob- 
tainable from Alicyclobaccillus sp., in particular that strain of Alicyclobacillus sp. Deposited un- 
der DSM accession No. 15716, more particularly the mature multi copper oxidase comprised in 
SEQ ID NO: 28 or 35. More specifically the mature multi copper oxidase comprises or consists 

20 of the sequences from position 26 to 315 of SEQ ID NO: 28 or position 50 to 597 of SEQ ID 
NO: 35. In the present context a multi-Cu-oxidase is defined as a protein, which possesses at 
least three spectroscopically different copper centers. Multicopper oxidases can be laccases, 
that oxidizes many different types of phenols and diamines, ascorbate oxidases, ceruloplas- 
min, that oxidizes a great variety of inorganic and organic substances or part of proteins that 

25 have lost the ability to bind copper and thereby mediate heavy metal resistance by sequestra- 
tion of the heavy metal in the periplasm of the bacterium. 

SEQ ID NO: 29 or SEQ ID NO: 30 Serine-carboxvl protease 

In a particular embodiment the enzyme of the invention is a serine-carboxyl protease compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 

30 more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with the serine-carboxyl pro- 
tease obtainable from Alicyclobaccillus sp., in particular that strain of Alicyclobacillus sp. De- 
posited under DSM accession No. 15716, more particularly the mature serine-carboxyl prote- 
ase comprised in SEQ ID NO: 29 or 30. More specifically the mature serine-carboxyl protease 

35 comprises or consists of the sequences from position 190 to 626 of SEQ ID NO: 29 or position 
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25 to 533 of SEQ ID NO: 30. In the present context a serine-carboxyl protease is defined as a 
protease belonging to the Enzyme class EC 3.4.21.100 (pseudomonapepsin) which proteolytic 
enzymes fold resembles that of subtilisin, with a unique catalytic triad, Ser-Glu-Asp, as well as 
the presence of an aspartic acid residue in the oxyanion hole. A polypeptide sequence can be 
5 classified as a serine-carboxyl peptidase, if the amino acids of the catalytic site are present in 
the sequence and if it shows peptide sequence similarity to peptide sequences in MEROPS 
serine protease family 53. 

SEQ ID NO: 31 serine protease or a HtrA-like serine protease 

In a particular embodiment the polypeptide of the invention is a serine protease or a HtrA-like 
10 serine protease comprising or consisting of an amino acid sequence which has at least 90%, 
particularly at least 95%, more particularly at least 96%, more particularly at least 97%, more 
particularly at least 98%, more particularly at least 99% or most particularly 100% identity with 
the serine protease or the HtrA-like serine protease obtainable from Alicyclobaccillus sp., in 
particular that strain of Alicyclobacillus sp. Deposited under DSM accession No. 15716, more 
15 particularly the mature serine protease arboxyl protease comprised in SEQ ID NO: 31. More 
specifically the mature serine protease comprises or consists of the sequences from position 
42 to 41 1 of SEQ ID NO: 31. In the present context a serine protease is defined as an enzyme, 
which hydrolyses proteins or peptides, and which contains a serine residue in the catalytic site. 
A HtrA-like protease is defined as an enzyme that degrades damaged proteins in the extracel- 
20 lular compartment of a bacterial cell at elevated temperatures. 

SEQ ID NO: 32 disulfide isomerase 

In a particular embodiment the polypeptide of the invention is a disulfide isomerase comprising 
or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 

25 more particularly at least 99% or most particularly 100% identity with the disulfide isomerase 
obtainable from Alicyclobaccillus sp., in particular that strain of Alicyclobacillus sp. Deposited 
under DSM accession No. 15716, more particularly the mature disulfide isomerase comprised 
in SEQ ID NO: 32. More specifically the mature disulfide isomerase comprises or consists of 
the sequences from position 31 to 212 of SEQ ID NO: 32. In the present context a disulphide 

30 isomerase is defined as enzyme, which catalyses the rearrangement of both intrachain and 
interchain disulfide bonds in proteins to form the native structures. 

SEQ ID NO:33 qamma-D-qlutamyl-L-diamino acid endopeptidase 

In a particular embodiment the polypeptide of the invention is a gamma-D-glutamyl-L-diamino 
acid endopeptidase comprising or consisting of an amino acid sequence which has at least 
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90%, particularly at least 95%, more particularly at least 96%, more particularly at least 97%, 
more particularly at least 98%, more particularly at least 99% or most particularly 100% identity 
with the gamma-D-glutamyl-L-diamino acid endopeptidase obtainable from Alicyclobaccillus 
sp., in particular that strain of Alicyciobacillus sp. Deposited under DSM accession No. 15716, 
5 more particularly the mature gamma-D-glutamyl-L-diamino acid endopeptidase comprised in 
SEQ ID NO: 33. More specifically the mature gamma-D-glutamyl-L-diamino acid endopepti- 
dase comprises or consists of the sequences from position 30 to 266 of SEQ ID NO: 33. In the 
present context a gamma-D-glutamyl-L-diamino acid endopeptidase is defined as an enzyme 
that hydrolyses gamma-D-glutamyl bonds to (L) meso-diaminopimelic acid in L-Ala-gamma-D- 
10 Glu-|-(L)meso-diaminopimelic acid-(L)-D-Ala. It is required that the omega-amino and omega- 
carboxyl groups of the (L) meso- diaminopimelic acid group are unsubstituted. 

SEQ ID NO: 34 endo-beta-N-acetylgfucosaminidase 

In a particular embodiment the polypeptide of the invention is an endo-beta-N- 
acetylglucosaminidase comprising or consisting of an amino acid sequence which has at least 

15 90%, particularly at least 95%, more particularly at least 96%, more particularly at least 97%, 
more particularly at least 98%, more particularly at least 99% or most particularly 100% identity 
with the endo-beta-N-acetylglucosaminidase obtainable from Alicyclobaccillus sp., in particular 
that strain of Alicyciobacillus sp. Deposited under DSM accession No. 15716, more particularly 
the mature endo-beta-N-acetylglucosaminidase comprised in SEQ ID NO: 3 4. More s pecifi- 

20 cally the mature endo-beta-N-acetylglucosaminidase comprises or consists of the sequences 
from position 27 to 768 of SEQ ID NO: 34. In the present context an endo-beta-N- 
Acetylglucosaminidase is defined as enzyme, that hydrolyses the 1,4-beta-linkages between 
N-acetyl-D-glucosamine and N-acetylmuramic acid in peptidoglycan heteropolymers of the 
prokaryotes cell walls. 

25 SEQ ID NO:36.peptidyl-prolvl-isomerase 

In a particular embodiment the polypeptide of the invention is a peptidyl-prolyl-isomerase com- 
prising or consisting of an amino acid sequence which has at least 90%, particularly at least 
95%, more particularly at least 96%, more particularly at least 97%, more particularly at least 
98%, more particularly at least 99% or most particularly 100% identity with the peptidyl-prolyl- 
30 isomerase obtainable from Alicyclobaccillus sp., in particular that strain of Alicyciobacillus sp. 
Deposited under DSM accession No. 15716, more particularly the mature peptidyl-proJyl- 
isomerase comprised in SEQ ID NO: 36. More specifically the mature peptidyl-prolyl- 
isomerase comprises or consists of the sequences from position 30 to 246 of SEQ ID NO: 36. 
In the present context a peptidyl-prolyl-isomerase is defined as an enzyme that accelerates 
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protein folding by catalyzing the cts-trans isomerization of proline imidic peptide bonds in oli- 
gopeptides. 

SEQ ID NO: 37 acid phosphatase or a phvtase or a phospholipase C 

In a particular embodiment the polypeptide of the invention is an acid phosphatase or a phy- 
5 tase or a phospholipase C comprising or consisting of an amino acid sequence which has at 
least 90%, particularly at least 95%, more particularly at least 96%, more particularly at least 
97%, more particularly at least 98%, more particularly at least 99% or most particularly 100% 
identity with the acid phosphatase or phytase or phospholipase C obtainable from Alicyclobac- 
cillus sp., in particular that strain of Alicyclobacillus sp. Deposited under DSM accession No. 

10 15716, more particularly the mature acid phosphatase or phytase or phospholipase C com- 
prised in SEQ ID NO: 37. More specifically the mature acid phosphatase or a phytase or a 
phospholipase C comprises or consists of the sequences from position 28 to 608 of SEQ ID 
NO: 37. An acid phosphatase is defined as enzyme hydrolyzing an orthophosphoric monoester 
into an alcohol and phosphate. In the present context a phytase is defined as an enzyme re- 

15 moving removing a phosphate group from phytate. A phospholipase C is defined as an en- 
zyme hydrolysing phosphatidylcholine into 1 ,2-diacylglycerol and choline. 

SEQ ID NO: 38 or SEQ ID NO: 39 polysaccharide deacetvlase 

In a particular embodiment the polypeptide of the invention is a polysaccharide deacetylase or 
a xylan deacetylase comprising or consisting of an amino acid sequence which has at least 

20 90%, particularly at least 95%, more particularly at least 96%, more particularly at least 97%, 
more particularly at least 98%, more particularly at least 99% or most particularly 1 00% identity 
with the polysaccharide deacetylase or the xylan deacetylase obtainable from Alicyclobaccillus 
sp., in particular that strain of Alicyclobacillus sp. Deposited under DSM accession No. 15716, 
more particularly the mature polysaccharide deacetylase or a xylan deacetylase comprised in 

25 SEQ IDNO:38or39, M ore s pecifically t he m ature p olysaccharide d eacetylase o r a xylan 
deacetylase comprises or consists of the sequences from position 26 to 251 of SEQ ID NO: 38 
or position 22 to 324 of SEQ ID NO: 39. In the present context a polysaccharide deacetylase is 
defined as an enzyme, which removes acetyl residues from a specific acetyled polysaccharide 
by hydrolysis. A xylan deactylase is defined as an enzyme removing acetyl groups from acety- 

30 lated xylan. 

SEQ ID NO:40 sulfite oxidase 

In a particular embodiment the polypeptide of the invention is a sulfite oxidase comprising or 
consisting of an amino acid sequence which has at least 90%, particularly at least 95%, more 
particularly at least 96%, more particularly at least 97%, more particularly at least 98%, more 

20 
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particularly at least 99% or most particularly 100% identity with the sulfite oxidase obtainable 
from Alicyclobacciltus sp., in particular that strain of Aficyclobacillus sp. Deposited under DSM 
accession No. 15716, more particularly the mature sulfite oxidase comprised in SEQ ID NO: 
40. More specifically the mature sulfite oxidase comprises or consists of the sequences from 
5 position 30 to 214 of SEQ ID NO: 40. A sulfite oxidase is defined as enzyme that oxidizes sul- 
fite to sulfate. 

SEQ ID NO:41 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
10 more particularly at least 96%, more particularly at least 97%, more particularly at least 98% t 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:41. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 41. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 22 to 
257 of SEQ ID NO: 41. 

15 SEQ ID NO:42 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:42. In par^ 

20 ticular with the mature functional polypeptide comprised in SEQ ID NO: 42. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 25 to 
1130 of SEQ ID NO: 42. 

SEQ ID NO:43 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
25 ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:43. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 43. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 42 to 
30 248 of SEQ ID NO: 43. 

SEQ ID NO:44 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
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more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:44. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 44. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 26 to 
5 172 of SEQ ID NO: 44. 

SEQ ID NO:45 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%. more particularly at least 97%, more particularly at least 98%, 
10 more particularly at least 99% or most particularly 100% identity with SEQ ID NO:45. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 45. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 31 to 
242 of SEQ ID NO: 45. 

SEQ ID NO:46 functional polypeptide 

15 In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:46. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 46. More specifically 

20 the mature functional polypeptide comprises or consists of the sequences from position 25 to 
280 of SEQ ID NO: 46. 

SEQ ID NO:47 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
25 more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO:47. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 47. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 26 to 
478 of SEQ ID NO: 47. 

30 SEQ ID NO:48 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
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more particularly at least 99% or most particularly 100% identity with SEQ ID NO: 48. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 48. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 20 to 

340 of SEQ ID NO: 48. 

5 SEQ ID NO:49 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97%, more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO: 49. In par- 
10 ticular with the mature functional polypeptide comprised in SEQ ID NO: 49. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 30 to 

341 of SEQ ID NO: 49. 

SEQ ID NO: 50 functional polypeptide 

In a particular embodiment the polypeptide of the invention is a functional polypeptide compris- 
15 ing or consisting of an amino acid sequence which has at least 90%, particularly at least 95%, 
more particularly at least 96%, more particularly at least 97% t more particularly at least 98%, 
more particularly at least 99% or most particularly 100% identity with SEQ ID NO: 50. In par- 
ticular with the mature functional polypeptide comprised in SEQ ID NO: 50. More specifically 
the mature functional polypeptide comprises or consists of the sequences from position 29 to 
20 400 of SEQ ID NO: 50. 

Polynucleotides 

The present invention also relates to polynucleotides comprising or consisting of a nu- 
cleotide sequence encoding a polypeptide of the invention. In a particular embodiment, the nu- 

25 cleotide sequence is set forth in SEQ ID NO: 1 to SEQ ID NO: 25 including nucleotide se- 
quences differing there from by virtue of the degeneracy of the genetic code. In a further em- 
bodiment the polynucleotide of the invention is a modified nucleotide sequence which com- 
prises or consists of a nucleotide sequence selected from the regions of SEQ ID NO: 1 to SEQ 
ID NO: 25 encoding a mature polypeptide and which comprises at least one modifica- 

30 tion/mutation compared with the parent nucleotide sequence comprised in SEQ ID NO: 1 to 
SEQ ID NO: 25. 

The techniques used to isolate and/or done a nucleotide sequence encoding an en- 
zyme are known in the art and include isolation from genomic DNA, preparation from cDNA, or 
a combination thereof. The cloning of the nucleotide sequences of the present invention from 
35 such genomic DNA can be effected, e.g., by using the well known polymerase chain reaction 
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(PCR) or antibody screening of expression libraries to detect cloned DNA fragments with 
shared structural features. See, e.g., Innis et a/.. 1990, PCR: A Guide to Methods and Applica- 
tion, Academic Press, New York. Other amplification procedures such as ligase chain reaction 
(LCR), ligated activated transcription (LAT) and nucleotide sequence-based amplification 
5 (NASBA) may be used. 

The nucleotide sequence may be obtained by standard cloning procedures used in ge- 
netic engineering to relocate the nucleotide sequence from its natural location to a different site 
where it will be reproduced. The cloning procedures may involve excision and isolation of a 
desired fragment comprising the nucleotide sequence encoding the polypeptide, insertion of 
10 the fragment into a vector molecule, and incorporation of the recombinant vector into a host 
cell where multiple copies or clones of the nucleotide sequence will be replicated. The nucleo- 
tide sequence may be of genomic, cDNA, RNA, semi-synthetic, synthetic origin, or any 
combinations thereof. 

In particular the polynucleotide comprises, preferably consists of, a nucleotide se- 
15 quence which has at least 50% identity with a nucleotide sequence selected from the regions 
of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide. Particularly, the nucleotide 
sequence has at least 65% identity, more particularly at least 70% identity, more particularly at 
least 80% identity, more particularly at least 90% identity, more particularly at least 95% iden- 
tity, more particularly at least 96% identity, more particularly at least 97% identity, more par- 
20 ticularly at least 98% identity, more particularly at least 99% identity or most particularly 100% 
identity with a nucleotide sequence selected from the regions of SEQ ID NO: 1 to SEQ ID NO: 
25 encoding a mature polypeptide. Particularly, the nucleotide sequence comprises a nucleo- 
tide sequence selected from the regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a ma- 
ture polypeptide. In an even more particular embodiment, the nucleotide sequence consists of 
25 a nucleotide sequence selected from the regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding 
a mature polypeptide. 

In particlar the polynucleotide comprises, preferably consists of, a nucleotide sequence 
which has at least 50% identity, particularly at least 65% identity, more particularly at least 
70% identity, more particularly at least 80% identity, more particularly at least 90% identity, 

30 more particularly at least 95% identity, more particularly at least 96% identity, more particularly 
at least 97% identity, more particularly at least 98% identity, more particularly at least 99% 
identity or most particularly 100% identity with a nucleotide sequence encoding a mature en- 
zyme selected from acid endoglucanase or acid cellulase, aspartyl protease, multi copper oxi- 
dase, serine-carboxyl protease, serine protease or HtrA-like serine protease, disulfide isom- 

35 erase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, 
peptidyl-prolyl-isomerase, acid phosphatase or phytase or phospholipase C, polysaccharide 
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deacetylase or xylan deacetylase and sulfite oxidase secreted from the strain of Alicyclobacil- 
tus sp. Deposited under DSM accession No. 15716 

SEQ ID NO: 1 

In a particular embodiment the polynucleotide of the invention encodes an acid endoglucanase 
5 or acid cellulase and comprises or consists of an nucleotide sequence which has at least 70% 
identity, more particularly at least 80% identity, more particularly at least 90% identity, more 
particularly at least 95% identity, more particularly at least 96% identity, more particularly at 
least 97% identity, more particularly at least 98% identity, more particularly at least 99% iden- 
tity or most particularly 100% identity with the nucleotide sequence of position 73 to 2877of 
10 SEQ ID NO: 1 

SEQ ID NO: 2 

In a particular embodiment the polynucleotide of the invention encodes an aspartyl protease 
and comprises or consists of an nucleotide sequence which has at least 70% identity, more 
particularly at least 80% identity, more particularly at least 90% identity, more particularly at 
15 least 95% identity, more particularly at least 96% identity, more particularly at least 97% iden- 
tity, more particularly at feast 98% identity, more particularly at least 99% identity or most par- 
ticularly 100% identity with the nucleotide sequence of position 97 to 816 of SEQ ID NO: 2 

SEQ ID NO: 3 and 10 

In a particular embodiment the polynucleotide of the invention encodes an multi copper oxi- 
20 dase and comprises or consists of an nucleotide sequence which has at least 70% identity, 
more particularly at least 80% identity, more particularly at least 90% identity, more particularly 
at least 95% identity, more particularly at least 96% identity, more particularly at least 97% 
identity, more particularly at least 98% identity, more particularly at least 99% identity or most 
particularly 100% identity with the nucleotide sequence of position 76 to 945 of SEQ ID NO: 1 
25 or 148 to 1791 of SEQ ID NO: 10 

SEQ ID NO: 4 and 5 

In a particular embodiment the polynucleotide of the invention encodes a serine-carboxyl pro- 
tease and comprises or consists of an nucleotide sequence which has at least 70% identity, 
more particularly at least 80% identity, more particularly at least 90% identity, more particularly 
30 at least 95% identity, more particularly at least 96% identity, more particularly at least 97% 
identity, more particularly at least 98% identity, more particularly at least 99% identity or most 
particularly 100% identity with the nucleotide sequence of position 568 to 1878 of SEQ ID NO: 
4 or 73 to 1599 of SEQ ID NO: 5. 
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SEQ ID NO: 6 

In a particular embodiment the polynucleotide of the invention encodes a serine protease or a 
HtrA-like serine protease and comprises or consists of an nucleotide sequence which has at 
least 70% identity, more particularly at least 80% identity, more particularly at least 90% iden- 
5 tity, more particularly at least 95% identity, more particularly at least 96% identity, more par- 
ticularly at least 97% identity, more particularly at least 98% identity, more particularly at least 
99% identity or most particularly 100% identity with the nucleotide sequence of position 124 to 
1233 of SEQ ID NO: 6, 

SEQ ID NO: 7 

10 In a particular embodiment the polynucleotide of the invention encodes a disulfide isomerase 
and comprises or consists of an nucleotide sequence which has at least 70% identity, more 
particularly at least 80% identity, more particularly at least 90% identity, more particularly at 
least 95% identity, more particularly at least 96% identity, more particularly at least 97% iden- 
tity, more particularly at least 98% identity, more particularly at least 99% identity or most par- 

15 ticularly 100% identity with the nucleotide sequence of position 91 to 633 of SEQ ID NO: 7. 

SEQ ID NO: 8 

In a particular embodiment the polynucleotide of the invention encodes a gamma-D-glutamyl- 
L-diamino acid endopeptidase and comprises or consists of an nucleotide sequence which has 
at least 70% identity, more particularly at least 80% identity, more particularly at least 90% 
20 identity, more particularly at least 95% identity, more particularly at least 96% identity, more 
particularly at least 97% identity, more particularly at least 98% identity, more particularly at 
least 99% identity or most particularly 100% identity with the nucleotide sequence of position 
88 to 798 of SEQ ID NO: 8. 

SEQ ID NO: 9 

25 In a particular embodiment the polynucleotide of the invention encodes a endo-beta-N- 
acetylglucosaminidase and comprises or consists of an nucleotide sequence which has at 
least 70% identity, more particularly at least 80% identity, more particularly at least 90% iden- 
tity, more particularly at least 95% identity, more particularly at least 96% identity, more par- 
ticularly at least 97% identity, more particularly at least 98% identity, more particularly at least 

30 99% identity or most particularly 100% identity with the nucleotide sequence of position 79 to 
2304 of SEQ ID NO: 9. 



26 



1 0406.01 0-DK 



SEQ ID NO: 1 1 

In a particular embodiment the polynucleotide of the invention encodes a peptidyl-prolyl- 
isomerase and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
5 ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 88 to 735 of SEQ ID 
NO: 9. 

SEQ ID NO: 12 

10 In a particular embodiment the polynucleotide of the invention encodes a acid phosphatase or 
a phytase or a phospholipase C and comprises or consists of an nucleotide sequence which 
has at least 70% identity, more particularly at least 80% identity, more particularly at least 90% 
identity, more particularly at least 95% identity, more particularly at least 96% identity, more 
particularly at least 97% identity, more particularly at least 98% identity, more particularly at 

15 least 99% identity or most particularly 100% identity with the nucleotide sequence of position 
82 to 1824 of SEQ ID NO: 12. 

SEQ ID NO: 13 and 14 

In a particular embodiment the polynucleotide of the invention encodes a polysaccharide 
deacetylase or a xylan deacetylase and comprises or consists of an nucleotide sequence 
20 which has at least 70% identity, more particularly at least 80% identity, more particularly at 
least 90% identity, more particularly at least 95% identity, more particularly at least 96% iden- 
tity, more particularly at least 97% identity, more particularly at least 98% identity, more par- 
ticularly at least 99% identity or most particularly 100% identity with the nucleotide sequence of 
position 76 to 750 of SEQ ID NO: 13 or position 64 to 972 of SEQ ID NO: 14. 

25 SEQ ID NO: 15 

In a particular embodiment the polynucleotide of the invention encodes a sulfite oxidase and 
comprises or consists of an nucleotide sequence which has at least 70% identity, more particu- 
larly at least 80% identity, more particularly at least 90% identity, more particularly at least 95% 
identity, more particularly at least 96% identity, more particularly at least 97% identity, more 
30 particularly at least 98% identity, more particularly at least 99% identity or most particularly 
100% identity with the nucleotide sequence of position 88 to 642 of SEQ ID NO: 15, 
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SEQ ID NO: 16 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
5 ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 64 to 771 of SEQ ID 
NO: 16. 

SEQ ID NO: 17 

10 In a particular embodiment the polynucleotide of the invention encodes mature functional poly- 
peptide and comprises or consists of an nucleotide sequence which has at least 70% identity, 
more particularly at least 80% identity, more particularly at least 90% identity, more particularly 
at least 95% identity, more particularly at least 96% identity, more particularly at least 97% 
identity, more particularly at least 98% identity, more particularly at least 99% identity or most 

15 particularly 100% identity with the nucleotide sequence of position 73 to 3390 of SEQ ID NO: 
17. 

SEQ ID NO: 18 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
20 tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 124 to 744 of SEQ ID 
NO:. 18. 

25 SEQ ID NO: 19 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
30 97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 76 to 516 of SEQ ID 
NO: 19. 
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SEQ ID NO: 20 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
5 ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 91 to 726 of SEQ ID 
NO: 20. 

SEQ ID NO: 21 

10 In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 

15 most particularly 100% identity with the nucleotide sequence of position 73 to 540 of SEQ ID 
NO: 21. 

SEQ ID NO: 22 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
20 tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 76 to 1431 of SEQ ID 
NO: 22. 

25 SEQ ID NO: 23 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
30 97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 58 to 1020 of SEQ ID 
NO: 23. 
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SEQ ID NO: 24 

In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
5 ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 
most particularly 100% identity with the nucleotide sequence of position 88 to 1023 of SEQ ID 
NO: 24. 

SEQ ID NO. 25 

10 In a particular embodiment the polynucleotide of the invention encodes a mature functional 
polypeptide and comprises or consists of an nucleotide sequence which has at least 70% iden- 
tity, more particularly at least 80% identity, more particularly at least 90% identity, more par- 
ticularly at least 95% identity, more particularly at least 96% identity, more particularly at least 
97% identity, more particularly at least 98% identity, more particularly at least 99% identity or 

15 most particularly 100% identity with the nucleotide sequence of position 85 to 1 197 of SEQ ID 
NO: 25. 

Modification of a nucleotide sequence encoding a polypeptide of the present invention may be 
necessary for the synthesis of a polypeptde which comprises an amino acid sequence that has 

20 at least one substitution, deletion and/or insertion as compared to an amino acid sequence se- 
lected from mature polypeptide comprised in SEQ ID NO: 26 to SEQ ID NO: 50. 

It will be apparent to those skilled in the art that such modifications can be made to 
preserve the function of the enzyme i.e. made outside regions critical to the function of the en- 
zyme. Amino acid residues which are essential to the function are therefore preferably not sub- 

25 ject to modification, such as substitution. Amino acid residues essential to the function may be 
identified according to procedures known in the art, such as site-directed mutagenesis or 
alanine-scanning mutagenesis (see, e.g., Cunningham and Wells, 1989, Science 244: 1081- 
1085). Sites of substrate-enzyme interaction can be determined by analysis of the three- 
dimensional structure as determined by such techniques as nuclear magnetic resonance 

30 analysis, crystallography or photoaffinity labelling (see, e.g., de Vos et a/., 1992, Science 255: 
306-312; Smith et a/., 1992, Journal of Molecular Biology 224: 899-904; Wlodaver et a/., 1992, 
FEBS Letters 309: 59-64). 

Moreover, a nucleotide sequence encoding an enzyme of the invention may be modi- 
fied by introduction of nucleotide substitutions which do not give rise to another amino acid se- 

35 quence of the enzyme encoded by the nucleotide sequence, but which correspond to the 

codon usage of the host organism intended for production of the enzyme. 

30 
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The introduction of a mutation into the nucleotide sequence to exchange one nucleo- 
tide for another nucleotide may be accomplished by site-directed mutagenesis using any of the 
methods known in the art. Particularly useful is the procedure, which utilizes a supercoiled, 
double stranded DNA vector with an insert of interest and two synthetic primers containing the 
5 desired mutation. The oligonucleotide primers, each complementary to opposite strands of the 
vector, extend during temperature cycling by means of Pfu DNA polymerase. On incorporation 
of the primers, a mutated plasrnid containing staggered nicks is generated; Following tem- 
perature cycling, the product is treated with Dpn\, which is specific for methylated and hemi- 
methylated DNA to digest the parental DNA template and to select for mutation-containing syn- 
10 thesized DNA. Other procedures known in the art may also be used. For a general description 
of nucleotide substitution, one may consult with e.g., Ford et a/., 1991, Protein Expression and 
Purification 2: 95-107. 

The present invention also relates to a polynucleotide comprising, preferably consisting 
of, a nucleotide sequence which encodes a polypeptide of the invention and which hybridizes 
15 under high stringency conditions, preferably under very high stringency conditions with a 
polynucleotide probe selected from the group consisting of: 

(i) the complementary strand to a nucleotide sequence selected from the regions of SEQ 
ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide , 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide sequences 
20 selected from the regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature poly- 
peptide and, 

(iii) a fragment of (i) or (ii) encoding a secreted mature polypeptide having the function of 
the corresponding mature polypeptides comprised in SEQ ID NO:26 to SEQ ID NO:50 

25 (J. Sambrook, E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual, 
2d edition, Cold Spring Harbor, New York). 

As will be understood, details and particulars concerning hybridization of the nucleotide se- 
quences will be the same or analogous to the hybridization aspects discussed in the section 

30 titled "polypeptides of the invention" herein. 

The present invention also encompasses a storage medium suitable for use in an elec- 
tronic device comprising information of the amino acid sequence of polypeptides of the inven- 
tion or the nucleotide sequences of the polynucleotide of the invention. The storage medium 
may suitably be a magnetic or optical disk and the electronic device a computing device and 

35 the information may in particular be stored on the storage medium in a digital form. 
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Nucleotide constructs 

The present invention also relates to nucleic acid constructs comprising a nucleotide 
sequence of the invention operably linked to one or more control sequences that direct the ex- 
pression of the coding sequence in a suitable host cell under conditions compatible with the 
5 control sequences. 

A nucleotide sequence encoding an enzyme of the invention may be manipulated in a 
variety of ways to provide for expression of the enzyme. Manipulation of the nucleotide se- 
quence prior to its insertion into a vector may be desirable or necessary depending on the ex- 
pression vector The techniques for modifying nucleotide sequences utilizing recombinant 
10 DNA methods are well known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleotide se- 
quence that is recognized by a host cell for expression of the nucleotide sequence. The pro- 
moter sequence contains transcriptional control sequences, which mediate the expression of 
the polypeptide. The promoter may be any nucleotide sequence which shows transcriptional 
15 activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may 
be obtained from genes encoding extracellular or intracellular polypeptides either homologous 
or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid con- 
structs of the present invention, especially in a bacterial host cell, are the promoters obtained 
20 from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis le- 
vansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene (amyL), Bacillus 
stearothermophilus maltogenic amylase gene (amy/tf), Bacillus amyloliquefaciens alpha- 
amylase gene {amyQ), Bacillus licheniformis penicillinase gene (penP), Bacillus subtilis xylA 
and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff et al. t 1978, Proceed- 
25 ings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac promoter 
(DeBoer et a/., 1983, Proceedings of the National Academy of Sciences USA 80: 21-25). Fur- 
ther promoters are described in "Useful proteins from recombinant bacteria" in Scientific 
American, 1980, 242: 74-94; and in Sambrook et a/., 1989, supra. 

Examples of suitable promoters for directing the transcription of the nucleic acid con- 
30 structs of the present invention in a filamentous fungal host cell are promoters obtained from 
the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, As- 
pergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus 
niger or Aspergillus awamori glucoamylase (glaA) t Rhizomucor miehei lipase, Aspergillus 
oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans 
35 acetamidase, and Fusarium oxysporum trypsin-like protease (WO 96/00787), as well as the 
NA2-tpi promoter (a hybrid of the promoters from the genes for Aspergillus niger neutral alpha- 
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amylase and Aspergillus oryzae triose phosphate isomerase), and mutant, truncated, and hy- 
brid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Saccharomyces cer- 
evisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharomyces 
5 cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP), 
and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful promoters for yeast 
host cells are described by Romanos et a/., 1992 t Yeast 8: 423-488. 

The control sequence may also be a suitable transcription terminator sequence, a se- 
quence recognized by a host cell to terminate transcription. The terminator sequence is oper- 
10 ably linked to the 3' terminus of the nucleotide sequence encoding the enzyme. Any termina- 
tor which is functional in the host cell of choice may be used in the present invention. 

Preferred terminators for filamentous fungal host cells are obtained from the genes for 
Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidufans an- 
thranilate synthase, Aspergillus niger alpha-glucosidase, and Fusarium oxysporum trypsin-like 
15 protease. 

Preferred terminators for yeast host cells are obtained from the genes for Saccharomy- 
ces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomy- 
ces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for 
yeast host cells are described by Romanos et a/., 1 992, supra. 

20 The control sequence may also be a suitable leader sequence, a non-translated region 

of an mRNA which is important for translation by the host cell. The leader sequence is opera- 
bly linked to the 5' terminus of the nucleotide sequence encoding the polypeptide. Any leader 
sequence that is functional in the host cell of choice may be used in the present invention. 

Preferred leaders for filamentous fungal host cells are obtained from the genes forV\s- 

25 pergillus oryzae TAKA amylase and Aspergillus nidufans triose phosphate isomerase. 

Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces 
cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Sac- 
charomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydro- 
genase/glyceratdehyde-3-phosphate dehydrogenase (ADH2/GAP). 

30 The control sequence may also be a polyadenylation sequence, a sequence operably 

linked to the 3' terminus of the nucleotide sequence and which, when transcribed, is recog- 
nized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any 
polyadenylation sequence which is functional in the host cell of choice may be used in the pre- 
sent invention. 

35 Preferred polyadenylation sequences for filamentous fungal host cells are obtained 

from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Asper- 
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gillus nidulans anthranilate synthase, Fusahum oxysporum trypsin-like protease, and Aspergil- 
lus niger alpha-glucosidase. 

Useful polyadenytation sequences for yeast host cells are described by Guo and 
Sherman, 1995, Molecular Cellular Biology 1 5: 5983-5990. 
5 The control sequence may also be a signal peptide coding region that codes for an 

amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded 
enzyme into the cell's secretory pathway. The 5' end of the coding sequence of the nucleotide 
sequence may inherently contain a signal peptide coding region naturally linked in translation 
reading frame with the segment of the coding region which encodes the secreted enzyme. Al- 
io ternatively, the 5' end of the coding sequence may contain a signal peptide coding region 
which is foreign to the coding sequence. The foreign signal peptide coding region may be re- 
quired where the coding sequence does not naturally contain a signal peptide coding region. 
Alternatively, the foreign signal peptide coding region may simply replace the natural signal 
peptide coding region in order to enhance secretion of the enzyme. However, any signal pep- 
15 tide coding region which directs the expressed enzyme into the secretory pathway of a host 
cell of choice may be used in the present invention. 

Effective signal peptide coding regions for bacterial host cells are the signal peptide 
coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus 
stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta- 
20 lactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus 
subtifis prsA. Further signal peptides are described by Simonen and Palva, 1993, Microbi- 
ological Reviews 57: 109-1 37. 

Effective signal peptide coding regions for filamentous fungal host cells are the signal 
peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Asper- 
25 gillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehei aspartic pro- 
teinase, Humicola insolens cellulase, and Humicola lanuginosa lipase. 

Useful signal peptides for yeast host cells are obtained from the genes for Saccharo- 
myces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal 
peptide coding regions are described by Romanos et a/., 1992, supra. 
30 The control sequence may also be a propeptide coding region that codes for an amino 

acid sequence positioned at the amino terminus of a enzyme. The resultant polypeptide may 
be denoted a pro-enzyme or propolypeptide. A propolypeptide is generally inactive and can be 
converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the propep- 
tide from the propolypeptide. The propeptide coding region may be obtained from the genes 
35 for Bacillus subtilis alkaline protease {aprE), Bacillus subtilis neutral protease (nprT), Sac- 
charomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and My- 
celiophthora thermophila laccase (WO 95/33836. 
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Where both signal peptide and propeptide regions are present at the amino terminus of 
a polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide 
and the signal peptide region is positioned next to the amino terminus of the propeptide region. 
In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the TAKA al- 
5 pha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus oryzae glu- 
coamylase promoter may be used as regulatory sequences. 

Other examples of regulatory sequences are those which allow for gene amplification. In 
eukaryotic systems, these include the dihydrofolate reductase gene which is amplified in the 
presence of methotrexate, and the metallothionein genes which are amplified with heavy met- 
10 als. I n these cases, the nucleotide sequence e needing the polypeptide would be operably 
linked with the regulatory sequence. 

Recombinant expression vectors 

The present invention also relates to recombinant expression vectors comprising the 

15 nucleic acid construct of the invention. The various nucleotide and control sequences de- 
scribed above may be joined together to produce a recombinant expression vector, which may 
include one or more convenient restriction sites to allow for insertion or substitution of the nu- 
cleotide sequence encoding the polypeptide at such sites. Alternatively, the nucleotide se- 
quence of the present invention may be expressed by inserting the nucleotide sequence or a 

20 nucleic acid construct comprising the sequence into an appropriate vector for expression. In 
creating the expression vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) that 
can be conveniently subjected to recombinant DNA procedures and can bring about the ex- 

25 pression of the nucleotide sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell into which the vector is to be introduced. The vec- 
tors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, /.e. f a vector that exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal replication, 

30 e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromo- 
some. 

The vector may contain any means for assuring self-replication. Alternatively, the vec- 
tor may be one which, when introduced into the host cell, is integrated into the genome and 
replicated together with the chromosome(s) into which it has been integrated. Furthermore, a 
35 single vector or plasmid or two or more vectors or plasmids which together contain the total 
DNA to be introduced into the genome of the host cell, or a transposon may be used. 
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The vectors of the present invention preferably contain one or more selectable markers 
that permit easy selection of transformed cells. A selectable marker is a gene the product of 
which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the like. 

Examples o f b acterial s electable m arkers a re the d al g enes from B acitlus s ubtilis o r 
Bacillus licheniformis, or markers that confer antibiotic resistance such as ampicillin, kanamy- 
cin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells are 
ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in a filamen- 
tous fungal host cell include, but are not limited to, amdS (acetamidase), argB (ornithine car- 
bamoyltransferase), bar (phosphinothricin acetyltransferase), hygB (hygromycin phosphotrans- 
ferase), niaD (nitrate reductase), pyrG (orotidine-5-phosphate decarboxylase), sC (sulfate 
adenyltransferase), trpC (anthranilate synthase), as well as equivalents thereof. 
Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus nidufans 
or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

The vectors of the present invention preferably contain an element(s) that permits sta- 
ble integration of the vector into the host cell's genome or autonomous replication of the vector 
in the cell independent of the genome. 

For i ntegration i nto the host cell genome, the vector may rely on the nucleotide se- 
quence encoding the polypeptide or any other element of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous recombination. Alternatively, 
the vector may contain additional nucleotide sequences for directing integration by homolo- 
gous recombination into the genome of the host cell. The additional nucleotide sequences en- 
able the vector to be integrated into the host cell genome at a precise location(s) in the chro- 
mosome^). To increase the likelihood of integration at a precise location, the integrational 
elements should preferably contain a sufficient number of nucleotides, such as 100 to 1,500 
base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 base pairs, 
which are highly homologous with the corresponding target sequence to enhance the probabil- 
ity of homologous recombination. The integrational elements may be any sequence that is 
homologous with the target sequence in the genome of the host cell. Furthermore, the integra- 
tional elements may be non-encoding or encoding nucleotide sequences. On the other hand, 
the vector may be integrated into the genome of the host cell by non-homologous recombina- 
tion. 

For autonomous replication, the vector may further comprise an origin of replication 
enabling the vector to replicate autonomously in the host cell in question. Examples of bacte- 
rial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, 
and pACYC184 permitting replication in E. co//, and pUB110, pE194, pTA1060, and pAMIJI 

permitting replication in Bacillus. Examples of origins of replication for use in a yeast host cell 
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are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and 
the combination of ARS4 and CEN6. 

The o rigin of replication m ay b e o ne h aving a m utation which makes i ts functioning 
temperature-sensitive i n the host cell (see, e.g., E hrlich, 1 978, Proceedings of the National 
5 Academy of Sciences USA 75: 1433). 

More than one copy of a nucleotide sequence of the present invention may be inserted 
into the host cell to increase production of the gene product. An increase in the copy number 
of the nucleotide sequence can be obtained by integrating at least one additional copy of the 
sequence into the host cell genome or by including an amplifiable selectable marker gene with 
10 the nucleotide sequence where cells containing amplified copies of the selectable marker 
gene, and thereby additional copies of the nucleotide sequence, can be selected for by culti- 
vating the cells in the presence of the appropriate selectable agent. 

The procedures used to ligate the elements described above to construct the recombi- 
nant expression vectors of the present invention are well known to one skilled in the art {see, 
15 e.g., Sambrook et a/., 1989, supra). 

Recombinant host cells 

The present invention also relates to recombinant a host cell comprising the nucleic acid con- 
struct of the invention, which are advantageously used in the recombinant production of the 

20 polypeptides. A vector c omprising a n ucleotide s equence o f t he p resent i nvention is i ntro- 
duced into a host cell so that the vector is maintained as a chromosomal integrant or as a self- 
replicating extra-chromosomal vector as described earlier. 

The host cell may be a unicellular microorganism, e.g., a prokaryote or a non- 
unicellular microorganism, e.g., a eukaryote. 

25 Useful unicellular cells are bacterial cells such as gram positive bacteria including, but 

not limited to, a Bacillus cell, e.g., Bacillus aikalophilus, Bacillus amyloliquefaciens, Bacillus 
brevis, Bacillus circulans, Bacillus claush\ Bacillus coagulans. Bacillus lautus. Bacillus lentus, 
Bacillus Hcheniformis, Bacillus megaterium, Bacillus stearothermophilus. Bacillus subtilis, and 
Bacillus thuringiensis: or a Streptomyces cell, e.g., Streptomyces lividans or Streptomyces 

30 murinus, or gram negative bacteria such as E. coli and Pseudomonas sp. In a preferred em- 
bodiment, the bacterial host cell is a Bacillus lentus, Bacillus licheniformis, Bacillus 
stearothermophilus t or Bacillus subtilis cell. In another preferred embodiment, the Bacillus cell 
is an alkalophilic Bacillus. 

The introduction of a vector into a bacterial host cell may, for instance, be effected by 

35 protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 
168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacterh 
ology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 
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209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), 
or conjugation {see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771-5278). 
The host cell may be a eukaryote, such as a mammalian, insect, plant, or fungal cell. 
In a preferred embodiment, the host cell is a fungal cell. "Fungi" as used herein in- 
5 eludes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined 
by Hawksworth et al, In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition. 1995, 
CAB International, University Press, Cambridge, UK) as well as the Oomycota (as cited in 
Hawksworth et al., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et at., 1995, 
supra). In a more preferred embodiment, the fungal host cell is a yeast cell. "Yeast" as used 

10 herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and 
yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may 
change in the future, for the purposes of this invention, yeast shall be defined as described in 
Biology and Activities of Yeast (Skinner, F.A., Passmore. S.M., and Davenport, R.R., eds, Soc. 
App. Bacterid. Symposium Series No. 9, 1980). 

15 In an even more preferred embodiment, the yeast host cell is a Candida, Hansenula, 

Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell. 

In a most preferred embodiment, the yeast host cell is a Saccharomyces carlsbergen- 
sis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Sac- 
charomyces ktuyveri, Saccharomyces norbensis or Saccharomyces oviformis cell. In another 

20 most preferred embodiment, the yeast host cell is a Kluyveromyces lactis cell. In another most 
preferred embodiment, the yeast host cell is a Yarrowia fipolytica cell. 

In another more preferred embodiment, the fungal host cell is a filamentous fungal cell. 
"Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota 
(as defined by Hawksworth et a/., 1995, supra). The filamentous fungi are characterized by a 

25 mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex 
polysaccharides. Vegetative growth is by hyphaf elongation and carbon catabolism is obliga- 
tely aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by 
budding of a unicellular thallus and carbon catabolism may be fermentative. 

In an even more preferred embodiment, the filamentous fungal host cell is a cell of a 

30 species of, but not limited to, Acremonium, Aspergillus, Fusarium, Humicola, Mucor, My- 
celiophthora, Neurospora, Peniciflium, Thielavia, Tolypocladium, or Trichoderma. 

In a most preferred embodiment, the filamentous fungal host cell is an Aspergillus 
awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger or 
Aspergillus oryzae cell. In another most preferred embodiment, the filamentous fungal host 

35 cell is a Fusarium bactridioides, Fusarium cerealis, Fusarium crookwetlense, Fusarium cul- 
morum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum t Fusarium ne- 
gundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, 
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Fusarium sarcochrvum, Fusarium sporotrichioides, Fusarium sulphureum, Fusahum torulo- 
sum, Fusarium trichothecioides, or Fusarium venenatum cell. In an even most preferred em- 
bodiment, the filamentous fungal parent cell is a Fusarium venenatum (Nirenberg sp. nov.) 
cell. In another most preferred embodiment, the filamentous fungal host cell is a Humicola in- 
5 solens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa t 
Penicillium purpurogenum, Thielavia terrestris, Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell. 

Fungal cells may be transformed by a process involving protoplast formation, transfor- 
mation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable 

10 procedures for transformation of Aspergillus host cells are described in EP 238 023 and Yelton 
et al. t 1984, Proceedings of the National Academy of Sciences USA 81 : 1470-1474, Suitable 
methods for transforming Fusarium species are described by Malardier et aA, 1989, Gene 78: 
147-156 and WO 96/00787. Yeast may be transformed using the procedures described by 
Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors. Guide to Yeast Genetics and 

15 Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., 
New York; Ito ef a/., 1983, Journal of Bacteriology 153: 163; and Hinnen etal, 1978, Proceed- 
ings of the National Academy of Sciences USA 75: 1920. 

Methods for preparing enzyme polypeptides 

20 The present invention also relates to methods for producing an enzyme of the invention com- 
prising (a) cultivating a strain comprising a nucleotide sequence encoding an enzyme of the 
invention which strain is capable of expressing and secreting the enzyme and (b) recovering 
the enzyme. In a particular embodiment the strain is a wild type strain such as the Alicycloba- 
cillus sp. DSM 15716, while in another embodiment the strain is a recombinant host cell as de- 

25 scribed, supra. 

In these methods of the invention, the cells are cultivated in a nutrient medium suitable 
for production of the enzyme using methods known in the art. For example, the cell may be 
cultivated by shake flask cultivation, small-scale or large-scale fermentation (including continu- 
ous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors per- 

30 formed in a suitable, medium and under conditions allowing the polypeptide to be expressed 
and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon 
and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media 
are available from commercial suppliers or may be prepared according to published composi- 
tions (e.g., in catalogues of the American Type Culture Collection). As the enzyme is secreted 

35 into the nutrient medium, the enzyme can be recovered directly from the medium. 

The resulting enzyme may be recovered by methods known in the art. For example, 
the enzyme may be recovered from the nutrient medium by conventional procedures inciuding. 
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but not limited to, centrifugation, filtration, extraction, spraynirying, evaporation, or precipita- 
tion. 

The polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hy- 
5 drophobic, chromatofocusing, and size exclusion), electrophoretic procedures {e.g., prepara- 
tive isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS- 
PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH 
Publishers, New York, 1989). 

10 Transgenic plants 

The present invention also relates to a transgenic plant, plant part, or plant cell that 
has been transformed with a nucleotide sequence encoding an enzyme of the invention so as 
to express and produce the enzyme. In one embodiment the plant could be used as host for 
production of enzyme in recoverable quantities. The enzyme may be recovered from the plant 

15 or plant part. Alternatively, the plant or plant part containing the recombinant enzyme may be 
used as such for improving the quality of a food or feed, e.g., improving nutritional value, 
palatability, and Theological properties, or to destroy an antinutritive factor. In particular t he 
plant or plant parts expressing the enzyme may be used as an improved starting material for 
production of fuel-alcohols or bio-ethanol 

20 The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a 

monocot). Examples of monocot plants are grasses, such as meadow grass (blue grass, 
Poa), forage grass such as festuca, lolium, temperate grass, such as Agrostis, and cereals, 
e.g., wheat, oats, rye, barley, rice, sorghum, and maize (corn). 

Examples of dicot plants are tobacco, legumes, such as lupins, potato, sugar beet, 

25 pea, bean and soybean, and cruciferous plants (family Brassicaceae), such as cauliflower, 
rape seed, and the closely related model organism Arabidopsis thaliana. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. Also 
specific plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, peroxisomes, and 
cytoplasm are considered to be a plant part. Furthermore, any plant cell, whatever the tissue 

30 origin, is considered to be a plant part. 

Also included within the scope of the present invention are the progeny of such plants, 
plant parts and plant cells. 

The transgenic plant or plant cell expressing an enzyme of the invention may be 
constructed in accordance with methods known in the art. Briefly, the plant or plant cell is 

35 constructed by incorporating one or more expression constructs encoding an enzyme of the 
invention into the plant host genome and propagating the resulting modified plant or plant cell 
into a transgenic plant or plant cell. 
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Conveniently, the expression construct is a nucleic acid construct which comprises a 
nucleotide sequence encoding an enzyme of the present invention operably linked with 
appropriate regulatory sequences required for expression of the nucleotide sequence in the 
plant or plant part of choice. Furthermore, the expression construct may comprise a selectable 
5 marker useful for identifying host cells into which the expression construct has been integrated 
and DNA sequences necessary for introduction of the construct into the plant in question (the 
latter depends on the DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences and 
optionally signal or transit sequences, is determined, for example, on the basis of when, 

10 where, and how the enzyme is desired to be expressed. For instance, the expression of the 
gene encoding an enzyme of the invention may be constitutive or inducible, or may be 
developmental, stage or tissue specific, and the gene product may be targeted to a specific 
tissue or plant part such as seeds or leaves. Regulatory sequences are, for example, 
described by Tague et a/., 1988, Plant Physiology 86: 506. 

15 For constitutive expression, the 35S-CaMV promoter may be used (Franck et aA, 

1 980, Cell 21 : 285-294). Organ-specific promoters may be, for example, a promoter from stor- 
age sink tissues such as seeds, potato tubers, and fruits (Edwards & Coruzzi, 1990, Ann. Rev. 
Genet 24: 275-303). or from metabolic sink tissues such as meristems (Ho ef a/., 1994, Plant 
Mol. Biol. 24: 863-878), a seed specific promoter such as the glutelin, prolamin, globulin, or 

20 albumin promoter from rice (Wu et ai 9 1998, Plant and Cell Physiology 39: 885-889), a Vicia 
faba promoter from the legumin B4 and the unknown seed protein gene from Vicia faba {Con- 
rad et a/., 1998, Journal of Plant Physiology 152: 708-711), a promoter from a seed oil body 
protein (Chen et a/., 1998, Plant and Cell Physiology 39: 935-941), the storage protein napA 
promoter from Brassica napus, or any other seed specific promoter known in the art, e.g., as 

25 described in WO 91/14772. Furthermore, the promoter may be a leaf specific promoter such 
as the rbcs promoter from rice or tomato (Kyozuka et ai, 1993, Plant Physiology 102: 991- 
1000, the chlorella virus adenine methyitransferase gene promoter (Mitra and Higgins, 1994, 
Plant Molecular Biology 26: 85-93), or the aldP gene promoter from rice (Kagaya et a/., 1995, 
Molecular and General Genetics 248: 668-674), or a wound inducible promoter such as the 

30 potato pin2 promoter (Xu et a/., 1 993, Plant Molecular Biology 22: 573-588). 

A promoter enhancer element may also be used to achieve higher expression of the 
enzyme of the invention in the plant. For instance, the promoter enhancer element may be an 
intron which is placed between the promoter and the nucleotide sequence encoding an 
enzyme of the present invention. For instance, Xu ef al., 1993, supra disclose the use of the 

35 first intron of the rice actin 1 gene to enhance expression. 

The selectable marker gene and any other parts of the expression construct may be 
chosen from those available in the art. 
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The nucleic acid construct is incorporated into the plant genome according to 
conventional techniques known in the art, including Agrobacterium-me6\aied transformation, 
virus-mediated transformation, microinjection, particle bombardment, biolistic transformation, 
and electroporation (Gasser et a/., 1990, Science 244: 1293; Potrykus, 1990, Bio/Technology 
5 8: 535; Shimamoto et a/., 1989, Nature 338: 274). 

Presently, Agrobactehum fumefac/ens-mediated gene transfer is the method of choice 
for generating transgenic dicots (for a review, see Hooykas and Schilperoort, 1992, Plant 
Molecular Biology 19: 15-38). However it can also be used for transforming monocots, 
although other transformation methods are generally preferred for these plants. Presently, the 
10 method of choice for generating transgenic monocots is particle bombardment (microscopic 
gold or tungsten particles coated with the transforming DNA) of embryonic calli or developing 
embryos (Christou, 1992, Plant Journal 2: 275-281; Shimamoto, 1994, Current Opinion 
Biotechnology 5: 158-162; Vasil et a/., 1992, Bio/Technology 10: 667-674). An alternative 
method for transformation of monocots is based on protoplast transformation as described by 
1 5 Omirulleh et a/., 1 993, Plant Molecular Biology 21:41 5-428. 

Following transformation, the transformants having incorporated therein the 
expression construct are selected and regenerated into whole plants according to methods 
well known in the art. 

The present invention also relates to methods for producing an enzyme of the invention 
20 comprising (a) cultivating a transgenic plant or a plant cell comprising a nucleotide sequence 
encoding an enzyme of the invention under conditions conducive for production of the enzyme 
and (b) recovering the enzyme. 

Compositions comprising polypeptides and methods for their preparation 

25 The invention provide a composition comprising a polypeptide of the invention and an excipient 
and a method for preparing such a composition comprising admixing the polypeptide of the 
invention with an excipient. In a particular embodiment the polypeptide of the invention is the 
major (polypeptide) component of the composition, e.g., a mono-component composition. The 
excipient in this context is to be understood as any auxilliary agent or compound used to for- 

30 mulate the composition and includes solvent, carriers, stabilizers and the like. 

The composition may further comprise one or more additional enzymes, such as an 
aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, cellulase, chitinase, cu- 
tinase, cycJodextrin glycosyltransferase, deoxyribonuclease, esterase, alpha-galactosidase, 
beta-galactosidase, glucoamylase, alpha-glucosrdase, beta-glucosidase, haloperoxidase, in- 

35 vertase, laccase, lipase, mannosidase, oxidase, pectinolytic enzyme, peptidoglutaminase, per- 
oxidase, phytase, polyphenoloxidase, proteolytic enzyme, ribonuclease, transglutaminase, or 
xylanase. 



10406.010-DK 



The compositions may be prepared in accordance with methods known in the art and 
may be in the form of a liquid or a solid composition. For instance, the enzyme composition 
may be formulated using methods known to the art of formulating polypeptides and/or 
pharmaceutical products, e.g. into coated or uncoated granules or micro-granules. The 
5 polypeptide of the invention may thus be provided in the form of a granule, preferably a non- 
dusting granule, a liquid, in particular a stabilized liquid, a slurry or a protected polypeptide. For 
certain applications, immobilization of the polypeptide on a solid matrix may be preferred. 

The polypeptide to be included in the composition may be stabilized in accordance with 
methods known in the art e.g. by stabilizing the polypeptide in the composition by adding and 
10 antioxidant or reducing agent to limit oxidation of the polypeptide or it may be stabilized by 
adding polymers such as PVP, PVA, PEG or other suitable polymers known to be beneficial to 
the stability of polypeptides in solid or liquid compositions 

In a further embodiment the composition of the invention is a detergent composition 
which, in addition to the polypeptide of the invention, comprises a surfactant and optionally 
15 compounds selected from the group consisting of builders such as zeolites, bleaching agents 
such as percarbonate, bleach enhancers such as TAED or NOBS, suds suppressors, 
frag rants, etc. 

In a further embodiment the composition of the invention is a feed composition that in 
addition to the polypeptide of the invention comprises a cereal or grain product. 
20 In a further embodiment the composition of the invention is a food composition such as 

a bakers flour composition, a brewed product, a fruit juice, an oil or lard product comprising the 
polypeptide of the invention. 

In a further embodiment the composition of the invention is a pulping composition, 
which in addition to the polypeptide of the invention, comprises pulp. 
25 In a further embodiment the composition of the invention is a biocidal composition, 

which comprises in addition to the polypeptide of the invention, an oxidoreductase enhancer. 

Use of polypeptides or compositions comprising them 

In still further aspects the invention provides use of the polypeptides or polynucleotides 
30 of the invention or a composition comprising said polypeptides or polynucleotides in various 
applications, particularly (technical) processes such as processes performed in industry or 
household, herein under for commercial research purposes. Hence the invention encom- 
passes a process comprising employing a polypeptide of the invention or a polynucleotide of 
the invention in a (technical) industrial, research or household process. 
35 In one embodiment the polypeptide or the composition of the invention is used for 

cleaning a cellulosic fabric. 
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In another embodiment the polypeptide or the composition of the invention is used to 
prepare a food or feed additive. 

In yet another embodiment the polypeptide or the composition of the invention is used 
for treatment of lignolosic materials and pulp. 

5 DETERGENT DISCLOSURE 

The polypeptide of the invention may be added to and thus become a component of a 
detergent composition. 

The detergent composition of the invention may for example be formulated as a hand 
or machine laundry detergent composition including a laundry additive composition suitable for 
10 pre-treatment of stained fabrics and a rinse added fabric softener composition, or be formu- 
lated as a detergent composition for use in general household hard surface cleaning opera- 
tions, or be formulated for hand or machine dishwashing operations. 

In a specific aspect, the invention provides a detergent additive comprising the poly- 
peptide of the invention. The detergent additive as well as the detergent composition may com- 
15 prise one or more other enzymes such as a protease, a lipase, a cutinase, an a mylase, a 
carbohydrase, a cellulase, a pectinase, a mannanase, an arabinase, a galactanase, a xy- 
ianase, an oxidase, e.g., a laccase, and/or a peroxidase. 

In general the properties of the chosen enzyme(s) should be compatible with the se- 
lected detergent, (i.e. pH-optimum, compatibility with other enzymatic and non-enzymatic in- 
20 gradients, etc.), and the enzyme(s) should be present in effective amounts. 

Proteases: Suitable proteases include those of animal, vegetable or microbial origin. 
Microbial origin is preferred. Chemically modified or protein engineered mutants are included. 
The protease may be a serine protease or a metallo protease, preferably an alkaline microbial 
protease or a trypsin-like protease. Examples of alkaline proteases are subtilisins, especially 
25 those derived from Bacillus, e.g., subtilisin Novo, subtilisin Carlsberg, subtilisin 309, subtilisin 
147 and s ubtilis'm 1 68 (described i n WO 89/06279). Examples of trypsin-like proteases a re 
trypsin (e.g. of porcine or bovine origin) and the Fusarium protease described in WO 89/06270 
and WO 94/25583. 

Examples of useful proteases are the variants described in WO 92/19729, WO 
30 98/20115, WO 98/20116, and WO 98/34946, especially the variants with substitutions in one 
or more of the following positions: 27, 36, 57, 76, 87, 97, 101, 104, 120, 123, 167, 170, 194, 
206, 218, 222, 224, 235 and 274. 

Preferred commercially available protease enzymes include Alcalase®, Savinase®, 
Primase®, Duralase®, Esperase®, and Kannase® (Novozymes A/S), Maxatase®, Maxacal®, 
35 Maxapem®, Properase®, Purafect®, Purafect OxP®, FN2®, and FN3® (Genencor Interna- 
tional Inc.). 
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Lipases: Suitable lipases include those of bacterial or fungal origin. Chemically modi- 
fied or protein engineered mutants are included. Examples of useful lipases include lipases 
from Humicola (synonym Thermomyces), e.g. from H. lanuginosa (T. lanuginosus) as de- 
scribed in EP 258 068 and EP 305 216 or from H. insolens as described in WO 96/13580, a 
5 Pseudomonas lipase, e.g. from P. alcaligenes or P. pseudoalcaligenes (EP 218 272), P. ce- 
pacia (EP 331 376), P. stutzeri (GB 1,372,034), P. fluorescens, Pseudomonas sp. strain SD 
705 (WO 95/06720 and WO 96/27002), P. wisconsinensis (WO 96/12012), a Bacillus lipase, 
e.g. from B. subtilis (Dartois et al. (1993), Biochemica et Biophysica Acta, 1131, 253-360). B. 
stearothermophilus (JP 64/744992) or B. pumilus (WO 91/16422). 

10 Other examples are lipase variants such as those described in WO 92/05249, WO 

94/01541, EP 407 225, EP 260 105, WO 95/35381, WO 96/00292, WO 95/30744, WO 
94/25578, WO 95/14783, WO 95/22615, WO 97/04079 and WO 97/07202. 

Preferred commercially available lipase enzymes include LipolaseTM, Lipolase Ul- 
traTM and Lipex (Novozymes A/S). 

15 Amylases: Suitable amylases (alpha and/or beta) include those of bacterial or fungal 

origin. Chemically modified or protein engineered mutants are included. Amylases include, for 
example, alpha-amylases obtained from Bacillus, e.g. a special strain of B. licheniformis, de- 
scribed in more detail in GB 1,296,839. 

Examples of useful amylases are the variants described in WO 94/02597, WO 

20 94/18314, WO 96/23873, and WO 97/43424, especially the variants with substitutions in one 
or more of the following positions: 15, 23. 105, 106, 124, 128, 133, 154, 156, 181, 188, 190, 
197, 202, 208, 209, 243, 264, 304. 305, 391, 408, and 444. 

Commercially available amylases are Duramyl™, Termamyl™, Fungamyl™ and 
BAN™ (Novozymes A/S), Rapidase™ and Purastar™ (from Genencor International Inc.). 

25 Cellulases: Suitable cellulases include those of bacterial or fungal origin. Chemically 

modified or protein engineered mutants are included. Suitable cellulases include cellulases 
from the genera Bacillus, Pseudomonas. Humicola, Fusarium, Thielavia, Acremonium, e.g. the 
fungal cellulases produced from Humicola insolens, Myceliophthora thermophila and Fusarium 
oxysporum disclosed in US 4,435,307, US 5,648.263, US 5,691,178, US 5,776,757 and WO 

30 89/09259. 

Especially suitable cellulases are the alkaline or neutral cellulases having colour 
care benefits. Examples of such cellulases are cellulases described in EP 0 495 257, EP 0 531 
372, WO 96/11262, WO 96/29397. WO 98/08940. Other examples are cellulase variants such 
as those described in WO 94/07998, EP 0 531 315, US 5,457.046, US 5,686,593, US 
35 5.763,254, WO 95/24471 , WO 98/12307 and PCT/DK9 8/00299. 
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Commercially available cellulases include Celluzyme®, and Carezyme® (No- 
vozymes), Clazinase®, and Puradax HA® (Genencor International Inc.), and KAC-500(B) ® 
(Kao Corporation). 

Peroxidases/Oxidases: Suitable peroxidases/oxidases include those of plant, bacterial 
5 or fungal origin. Chemically modified or protein engineered mutants are included. Examples of 
useful peroxidases include peroxidases from Coprinus, e.g. from C. cinereus, and variants 
thereof as those described in WO 93/24618, WO 95/10602, and WO 98/15257. 

Commercially available peroxidases include Guardzyme® (Novozymes A/S). 
The detergent enzyme(s) may be included in a detergent composition by adding 
10 separate additives containing one or more enzymes, or by adding a combined additive com- 
prising all of these enzymes. A detergent additive of the invention, i.e. a separate additive or a 
combined additive, can be formulated e.g. as a granulate, a liquid, a slurry, etc. Preferred de- 
tergent additive formulations are granulates, in particular non-dusting granulates, liquids, in 
particular stabilized liquids, or slurries. 
15 Non-dusting granulates may be produced, e.g., as disclosed in US 4,106,991 

and 4,661,452 and may optionally be coated by methods known in the art. Examples of waxy 
coating materials are poly(ethylene oxide) products (polyethyleneglycol, PEG) with mean mo- 
lar weights of 1000 to 20000; ethoxylated nonylphenols having from 16 to 50 ethylene oxide 
units; ethoxylated fatty alcohols in which the alcohol contains from 12 to 20 carbon atoms and 
20 in which there are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; and mono- and di- 
and triglycerides of fatty acids. Examples of film-forming coating materials suitable for applica- 
tion by fluid bed techniques are given in GB 1483591. Liquid enzyme preparations may, for 
instance, be stabilized by adding a polyol such as propylene glycol, a sugar or sugar alcohol, 
lactic acid or boric acid according to established methods. Protected enzymes may be pre- 
25 pared according to the method disclosed in EP 238,216. 

The detergent composition of the invention may be in any convenient form, e.g., 
a bar, a tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aqueous, 
typically containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 

The detergent composition comprises one or more surfactants, which may be 
30 non-ionic including semi-polar and/or anionic and/or cationic and/or zwitterionic. The surfac- 
tants are typically present at a level of from 0.1 % to 60% by weight. 

When included therein the detergent will usually contain from about 1% to about 
40% of an anionic surfactant such as linear alkylbenzenesulfonate, alpha-olefinsulfonate, alkyl 
sulfate (fatty a Icohol s ulfate), alcohol e thoxysulfate, s econdary a Ikanesulfonate, a Ipha-sulfo 
35 fatty acid methyl ester, alkyl- or alkenylsuccinic acid or soap. 

When included therein the detergent will usually contain from about 0.2% to 

about 40% of a non-ionic surfactant such as alcohol ethoxylate, nonylphenol ethoxylate, alkyl- 
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polyglycoside, alky Idime thy lamineoxide, ethoxylated fatty acid monoethanolamide, fatty acid 
monoethanolamide, polyhydroxy alkyl fatty acid amide, or N-acyl N-alkyl derivatives of gluco- 
samine ("glucamides"). 

The detergent may contain 0-65 % of a detergent builder or complexing agent 
5 such as zeolite, diphosphate, triphosphate, phosphonate, carbonate, citrate, nitrilotriacetic 
acid, ethylenediaminetetraacetic acid, diethylenetriaminepentaacetic acid, alkyl- or alkenyisuc- 
cinic acid, soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). 

The detergent may comprise one or more polymers. Examples are carboxy- 
methylcellulose, poly(vinylpyrrolidone), poly (ethylene glycol), polyvinyl alcohol), 
10 poly(vinylpyridine-N-oxide), poly(vinylimidazole), polycarboxylates such as polyacrylates, 
maleic/acrylic acid copolymers and lauryl methacrylate/acrylic acid copolymers. 

The detergent may contain a bleaching system which may comprise a H202 
source such as perborate or percarbonate which may be combined with a peracid-forming 
bleach activator such as tetraacetylethylenediamine or nonanoyloxybenzenesulfonate. Alterna- 
15 tively, the bleaching system may comprise peroxyacids of e.g. the amide, imide, or sulfone 
type. 

The enzyme(s) of the detergent composition of the invention may be stabilized 
using conventional stabilizing agents, e.g., a polyol such as propylene glycol or glycerol, a 
sugar or sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g., an aromatic bo- 

20 rate ester, or a phenyl boronic acid derivative such as 4-formylphenyl boronic acid, and the 
composition may be formulated as described in e.g. WO 92/19709 and WO 92/19708. 

The detergent may also contain other conventional detergent ingredients such 
as e.g. fabric conditioners including clays, foam boosters, suds suppressors, anti-corrosion 
agents, soil-suspending agents, anti-soil redeposition agents, dyes, bactericides, optical bright- 

25 eners, hydrotropes, tarnish inhibitors, or perfumes. 

It is at present contemplated that in the detergent compositions any enzyme, in 
particular the enzyme of the invention, may be added in an amount corresponding to 0.01-100 
mg of enzyme protein per litre of wash liquor, preferably 0.05-5 mg of enzyme protein per liter 
of wash liquor, in particular 0.1-1 mg of enzyme protein per litre of wash liquor. 

30 The enzyme of the invention may additionally be incorporated in the detergent 

formulations disclosed in WO 97/07202 that is hereby incorporated as reference. 

DEPOSITED MICROORGANISMS 

The following microorganism were deposited by the applicant according to the Budapest 
Treaty on the International Recognition of the Deposits of Microorganisms for the Purpose of 
35 Patent Procedures at Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, 
Mascheroder Weg 1b, D-38124 Braunschweig, Germany: 
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June 30, 2003: Alicyclobacillus sp. CS81 thermo-acidophile; DSM accession No. 15716 
EXAMPLES 

5 Example 1 Identifying functional polypeptides secreted by Alicyclobacillus sp. DSM 
15716 

A. Genomic library construction 

Chromosomal DNA from Alicyclobacillus sp. DSM 15716 was prepared by using standard mo- 
10 lecutar biology techniques (Ausuble et aL 1995 "Current protocols in molecular biology" Publ: 
John Wiley and sons). The prepared DNA was partially cleaved with Sau3A and separated on 
an agarose gel. Fragments of 3 to 8 kilobases were eluted and precipitated and resuspended 
in a suitable buffer. 

A genomic library was made by using the Stratagene ZAP Express ™ predigested Vector kit 
15 and Stratagene ZAP Express ™predigested Gigapack ® cloning kit (Bam HI predigested) 
(Stratagene Inc., USA) following the instructions/recommendations from the vendor. The re- 
sulting lambdaZAP library comprised 38000 pfu of which 10000 were collected for mass exci- 
sion. The resulting 70000 E. coli colonies were pooled and plasmids were prepared by using 
the Qiagen Spin Mini prep kit (Qiagen, Germany). The eluate of approx. 1 ml containing the 
20 plasmid DNA was precipitated in a centrifuge with 1 volume part of Na-acetate pH5 and 2 vol- 
ume parts 96% ethanol at 20000 rpm at 4C, washed with 70% v/v ethanol, dried at room tem- 
perature and resuspended in 200 micro! TE buffer. The DNA concentration of the plasmid pool 
DNA of the Alicyclobacillus sp. genomic library was 5.2 microgram/microliter. 

25 B. Transposon construction and preparation 

Construction of a SigA4 transposon containing the (3-lactamase reporter gene. 

Following the instructions of WO 01/77315 A1, the construction of a transposon containing a 
signal-less 0-lactamase gene was carried o ut u sing standard molecular biology techniques. 
The signal-less p-lactamase gene was initially PCR amplified from the vector pUC19) using a 

30 proofreading polymerase (Pfu Turbo, Stratagene, USA). The resulting PCR fragment con- 
tained the restriction sites Not] and EcoRl in order to aid cloning. The plasmid pEntrancepo- 
son(Cam r ) containing the Entranceposon and the antibiotic resistance markers CAT (encoding 
chloramphencol resistance in the transposon) was obtained from Finnzymes, OY (Espoo 
Finland). The plasmid was digested with the restriction enzymes Not\ and EcoRl, gel purified 

35 and ligated with the signal-less p-lactamase containing fragment. The ligation was transformed 
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into electro-competent DH10B cells and the E.coli clone containing the recombinant plasmid 
with the signal-less p-lactamase was identified by restriction analysis and named SigA2. 

For transposon preparation, a smaller derivative of SigA2 was constructed, which 
lacked the bid gene encoding beta-lactamase: Two oligonucleotide primers SigA2NotU-P 5 - 
5 TCG CGA TCC GTT TTC GCA TTT ATC GTG AAA CGC T-3* (SEQ ID NO: 51) and 
SigA2NotD-P 5'-CCG CAA ACG CTG GTG AAA GTA AAA GAT GCT GAA-3' (SEQ ID NO: 
52), which bind to the start and stop of the bla gene of SigA2 directing outwards were used 
PCR amplify SigA2 without the bla gene. An amplificate of approx. 3,6 kb generated in the this 
PGR reaction was relegated and transformed in to a suitable E.coli strain. A plasmid of 3,6 kb 
10 was isolated from a transformant which was able to grow on LB chloramphenicol but not on LB 
arnpicillin. This plasmid maintained both Bglll sites and lacks the active bla gene and was 
called pSig4. 




-j g signal less Beta lactamase 

60 microliter of pSigA4 plasmid DNA preparation with a concentration of 0.3 micro- 
gram/microliter was digested with Bglll and separated on an agarose gel. The SigA2 transpo- 
son DNA band of 2 kb was eluted and purified by using the ,r GFX™PCR, DNA and Gel Band 
20 Purification Kit" (Amersham Pharmacia Biotech Inc. USA) according to the instructions of the 
vender and eluted in 200 microliter EB buffer. 

C. Transposon tagging 

The transposon prepared from pSigA4 carries a 5'-truncated bla-gene encoding a (3- 

25 lactamase from which the secretion signal has been removed. The p-lactamase conveys 

arnpicillin resistance on E. coli only when the protein is secreted to the periplasm, whereas 

cytoplasmic expression of p-Iactamase does not confer arnpicillin resistance. Without a signal 
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sequence, the 0-lactamase enzyme will not be transported to the periplasm and therefore the 
clone will not grow on media containing ampicillin. The signal-less (3-lactamase gene was 
contained within the transposon in such a way that there was a continuous open reading frame 
between the transposon border and the p-lactamase coding region. In this way the modified 
5 transposon, when it transposes into a gene encoding a protein that is secreted, could cause an 
in-frame fusion with the target gene. This resulted in a fusion gene product that is secreted to 
the periplasm of E. coti and conveys resistance to the ampicillin. If the transposon integrated 
even in-frame into a gene encoding a non-secreted protein, the respective host will not 
become ampicillin resistance. 

10 For the in vitro transposon tagging of the Alicyclobacillus sp. library, 4 or 8 microliter of 

SigA2 transposon containing approx. 2,6 ug DNA were mixed with 1 microliter of the DNA con- 
centration of the plasmid pool DNA of the Alicyclobacillus sp. genomic library, 2 microliter of 
Finnzymes MuA Transposase (0,22 microgram/microliter) and 5 microliter of 5x buffer from 
Finnzymes OY. Espoo, Finland) in a total volume of 50 microliter and incubated at 30 °C for 

15 3,5 h and followed by heat inactivation at 75 °C for 10 min. The DNA was precipitated by addi- 
tion of 5microIiter 3M Na-acetate pH5 and 110 microliter 96% ethanol and centrifugation for 30 
min at 20000 rpm. The pellet was washed and dried and resuspended in 10 microliter TE 
buffer. 

20 D. Transformation and selection 

Electro-competent E. coli DH10B cells were transformed by electroporation in a Biorad Gene 
Pulse device (50uF, 25mAmp, 1.8 kV with 5 microliter of the transposon tagged plasmid pool, 
mixed with 1ml SOC medium, pre-incubated for 1h at 37C and plated on LB with 25 microli- 
ter/mililiter ampicillin, 50 microliter/mililiter kanamycin, 10 microliter/mililiter chloramphenicol 
25 and incubated for 2-3 days. Out of the resistant transformants 1056 colonies were selected 
and plasmids were prepared by applying the Qiaprep 96 Turbo Biorobot kit according to the 
instructions of the vender. 

E. Plasmid preparation and sequencing 

30 1056 transposon tagged plasmids were sequenced in with the A2up primer 
AGCGTTTGCGGCCGCGATCC (SEQ ID NO: 53) which read upstream into the into the trans- 
poson tagged gene, and, in a second reaction, with B primer TTATTCGGTCGAAAAGGATCC 
(SEQ ID NO: 54) which read downstream into the transposon tagged gene. 

35 F. Sequence assembly and annotation 
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The obtained sequences were assembled into contigs by using the program PhredPhrap 
(Brent Ewing, LaDeana Hillier, Michael C. Wendl, and Phil Green, Base-calling of automated 
sequencer traces using phred l. Accuracy assessment (1998) Genome Research 8:175-185; 
Brent Ewing and Phil Green, Base-calling of automated sequencer traces using phred II. Error 
5 probabilities (1998) Genome Research 8:186-194). The obtained contigs were subsequently 
compared to sequences available in standard public DNA and protein sequences databases 
by using the program BLASTX 2.0a19MP-WashU [14-Jul-1998] [Build Iinux-x86 18:51:44 30- 
Jul-1998] (Gish, Warren (1994-1997). Unpublished; Gish. Warren and David J. States (1993). 
Identification of protein coding regions by database similarity search. Nat. Genet 3:266-72). 
10 The obtained sequences were functional genes which encoded intact and functional 

polypeptides, because they were obtained as ampicillin resistant clones as explained supra. 

Example 2. Determining function by homology 

The function of the polypeptides SEQ ID NO: 26 to SEQ ID NO: 50 were annotated by se- 
15 quences comparison with genes or polypeptides of known function. The polypeptides of the 
invention were compared to a list of closest related sequences from public and inhouse data- 
bases of contig's. The contigs, from which SEQ ID NO: 26 to SEQ ID NO: 50 were derived, 
were subsequently compared to sequences available in standard public DNA and protein se- 
quences databases by using the program BLASTX 2.0a19MP-WashU [14- Jul- 1998]. A careful 
20 analysis of sequence alignments of SEQ ID NO: 26 to SEQ ID NO: 40 to their closest related 
sequences with known function from other databases made it possible to predict the function 
of these polypeptides on the basis of the degree of amino acid identity. Even when the overall 
amino acid identity was below 40%, which usually makes it difficult to make a good prediction, 
we were able to predict the function of SEQ ID NO: 26 to SEQ ID NO: 40 by carefully analys- 
25 ing and interpreting the amino acid residues in the catalytic sites or in important regions of the 
polypeptide sequences. If the amino acids of the catalytic site of a known sequences were also 
present in the polypeptide of the invention, combined with a sufficient overall amino acid iden- 
tity, it was concluded that the polypeptide from Alicyclobacillus sp DSM 15716 had the same 
function as the known sequence. 

30 

Example 3 Preparing polypeptides of SEQ ID NO: 26 to SEQ ID NO: 50 

To prepare the polypeptides of SEQ ID NO: 26 to SEQ ID NO: 50, the genes comprised in 

SEQ ID NO: 1 to SEQ ID NO: 25 encoding these polypeptides are expressed by fusing the 

DNA encoding the open reading frame to DNA a promoter, ribosome- binding site and termina- 

35 tor suitable for genes expression in an appropriate host strain, for example Escherichia co//, 

Bacillus subtilis, Bacillus ficheniformis or Bacillus clausii or a derivative of Alicyclobacillus sp. 

The promoter can either be an inducible promotor or a constitutive promoter. Any signal se- 
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quences of SEQ ID NO: 26 to SEQ ID NO: 50 can be exchanged with a suitable signal peptide 
of another bacterium. The expression construct can either be part of a plasmid or of a linear 
DNA. It can be integrated into the chromosome of the host strain by recombination or it can be 
present in the host cell on a plasmid. Then the transformed cells carrying the gene of interest 
5 are grown in a suitable medium in the desired volume. If an inducible promoter is used r the 
gene expression is started by adding the inducer. Otherwise a no inducer is needed and the 
cells will be grown until a suitable amount of protein from the gene of interest is produced. 
Then the culture is harvested and the proteins are recovered by standard methods. 

10 Example 4. Determining serine-carboxyl protease activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting a serine-carboxy 
protease in a suitable buffer may be assayed for that activity. A suitable volume of such a 
sample is spotted on agarose plates which contain the insoluble chromogenic substrate AZCL- 
collagen (Megazyme ™) or Azocoll (Sigma-A)drich) and a suitable buffer at acidic pH, e.g. pH 

15 is 3-5. The plate is incubated for an appropriate time, e.g. one day at ah appropriate tempera- 
ture, e.g. 55CIC. The activity is visible as blue halos around the spots. As an alternative to 
AZCL-collagen or Azocoll, non-labelled collagen is added to agar plates, on which enzyme ac- 
tivity can be detected as clearing zones. By addition of pepstatin, the protease activity of a ser- 
ine carboxyl protease cannot be i nhibited. As a n a Itemative, the activity d etermination of a 

20 sample containing a serine-carboxy! protease can be measured as described in Tsuruoka N, 
Nakayama T, Ashida M, Hemmi H, Nakao M, Minakata H, Oyama H t Oda K, Nishino T; "Colla- 
genolytic serine-carboxyl proteinase from Alicyclobacillus sendaiensis strain NTAP-1: purifica- 
tion, characterization, gene cloning, and heterologous expression." Appl Environ Microbiol. Vol. 
69(1); pp 162-169; 2003 Jan. 

25 

Example 5. Determining multi-copper oxidase activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting a multi-copper oxi- 
dase in a suitable buffer may be assayed for that activity as described in Schneider et al., En- 
zyme and Microbial Technology 25, (1999) p. 502-508). 

30 For example a suitable volume, which can be 15 microliter, of such a sample is spotted 

on agarose plates which contain ABTS (2,2-Azinobis 3-Ethylbenzthiazolin-6-sulfonic acid) at a 
suitable concentration, e.g. 1mM, in a suitable puffer, e.g. 0,1 M sodium acetat buffer for pH 
5,5. The plate is incubated for an appropriate time e.g. 16 hours, at an appropriate tempera- 
ture, e.g. 55 °C. The activity is visible as a green zone around the sample. The assay works 

35 on supernatants and extracts. 
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Example 6. Determining serine protease activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting a serine protease in 
a suitable buffer may be assayed for that activity. A suitable volume of such a sample is spot- 
ted on agarose plates which contain the insoluble chromogenic substrate AZCL-casein 
5 (Megazyme ™) or AZCL-collagen (Megazyme ™) and a suitable buffer at suitable pH. The 
plate is incubated for an appropriate time, e.g. one day, at an appropriate temperature, e.g. 
55DC. The activity is visible as blue halos around the spots. As an alternative to AZCL-casein 
and AZCL-collagen (Megazyme ™) non-labelled casein or non-labelled collagene can be 
used. On non-labelled collagen or non-labelled casein spotted on agarose plates, clearing 
10 zones form in the presence of a serine protease. 

Example 7. Determining aspartyl protease activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting an aspartyl prote- 
ase in a suitable buffer was assayed for that activity. A suitable volume of such a sample can 

15 be spotted on agarose plates, which contain the insoluble chromogenic substrate AZCL- 
collagen (Megazyme ™> and a suitable buffer at acidic pH, e.g. pH is 3-5. The plate can be 
incubated for an appropriate time, e.g. one day, at an appropriate temperature, e.g. 55DC. The 
activity is visible a s blue halos a round the s pots. As an alternative to AZCL-collagen, n on- 
labelled collagen can be used. On non-labelled collagen spotted on agarose plates, clearing 

20 zones form in the presence of an aspartyl protease. Upon specifically testing the aspartyl pro- 
tease of ID NO: 27; the activity was determined as a spot test of 20 microliter culture fluid on 
0.1% AZCL-collagen (Megazyme ™) spotted on LB-PG agar plates at pH 3.4. The plates were 
incubated at 55 °C {over night) and the presence of the aspartyl protease was visible as blue 
halos around the spots. 

25 The aspartyl protease comprised in SEQ ID NO: 27 showed significant sequence similarity to 
aspartyl peptidases belonging to family A4. This family contains peptidase sequences, which 
have an aspartate and a glutamate residue in their active site. Both residues were conserved 
in the aspartyl protease comprised in SEQ ID NO: 2 7. The aspartyl protease comprised in 
SEQ ID NO: 27 is thus the first bacterial polypeptide showing significant sequence similarity 

30 especially at the active sites with sequences from peptidase family A4 and therefore also the 
first bacterial A4 protease. 

SEQ ID NO: 27 was compared to a reference sequence of family A4 peptidases; Aspergillus 
niger aspergillopepsin II (SEQ ID NO: 55; Swissprot P24665; Takahashi,K.; Inoue.H.; Sakai,K.; 
KohamaX; Kitahara.S,; Takishima.K.; Tanji.M.; Athauda,S.B.P.; Takahashi.T.; Akanuma,H.; 
35 Mamiya.G.; Yamasaki, M); The primary structure of Aspergillus niger acid proteinase A .; J. 
Biol. Chem.; Vol 266; p. 19480; 1991). This polypeptide contained a signal peptide (aa1-, and 
two propeptides (aa 19-58 and aa 99-109), which are removed after secretion during 
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maturation. During maturation a heavy and a light chain are formed, which are cross-linked by 
disulfide bridges between cysteine residues. (Inoue.H.; Kimura.T.; Makabe.O.; Takahashi p K.; 
The gene and deduced protein sequences of the zymogene of Aspergillus niger acid 
proteinase A; J. Biol. Chem.; vol. 266; p. 19484; 1991). The amino acids similar to the second 
5 propeptide (aa99-109) and the amino acids corresponding to the cross-linking cysteine 
residues of SEQ ID NO. 55 are missing in SEQ ID 27 (see alignment). Only a fungal A4 
peptidase has previously been described to to lack cysteine residues {Maita.T; Nagata,S.; 
Matsuda,G.; Maruta.S.; Oda.K.; Murao.S,; Tsuru.D.; Complete amino acid sequence of 
Scytalidium lignicolum acid protease B; J. Biochem.; vol. 95; p. 465; 1984). 

10 

Alignment of SEQ ID NO: 55 with SEQ ID NO: 27 



SWISSPROT_P24665 MKFSTI LTGS - LFATAAIiAAPLTEKRRARKEARAAGKRHSNPPYIPGSDKEILKLNGTTN 
Seq ID NO. 27 MNGTSVWKASG I AAASCLTAAALLAWPHATSTLDAS PAI FHAPRHALS PNTS PKPNS VQA 

15 

□□aaaannnao : 

SWISSPROT_P2 4 665 EEY SSNWAGAVLIGDGYTKVTGEFTVPSVSAGSSGSSGYGGGYGYWKNKRQSEEYCA 

Seq ID No. 27 QNFGWSASNWSGYAVTGSTYNDITGSWIVPAVSP SKR--STYS- 

20 : 

SWISSPROT_P24665 S AWVGI DGDTCETAI LQTGVDFC YEDGQTS YDAWYEWYPDYAYDFS DI TI S EGDS I KVTV 
Seq ID NO. 27 S SWI GI DG - FNNSDLIQTGTEQDYVNGHAQYDAWWEILPAPETVISNMTIAPGDRMSAHI 

* 

25 SWISSPROT_P24665 EATSKSSGSATVENLTTGOSVTHTFSGNVEGDLCETNAEWIVEDFESGDSLVAFADFGSV 
Seq ID No. 27 HNNGNGTWTITLTDVTRNETFSTTQSYSGPG SSAEWIQEAPEIGGRIATLANYGET 

SWISSPROT_P24 665 T FTNAEATS G - -GSTVGPSDAT 

30 Seq ID No. 27 TFDPGTVNGGNPGFTLVPTRATWCRTTRSCLCRPHPTRIPTASTWPTAPTSRAHRPPDPR 

SWISS PROT_P24 665 VMDI EQDGSVLTETSVSGDSVTVTYV 

Seq ID No. 2 7 RSRRPCMEAQGPASFFARTLAPSRDVAAHAPQGHRPSALVRRA 

35 

* = amino acids forming the active site in Swissprot P24665 

: = cysteine residues framing disulfide bonds in Swissprot P24665 

d = propeptide removed from the Swissprot P24665 zymogene. 



40 Example 8. Determining acid beta-glucanase activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting an beta-glucanase 
in a suitable buffer may be assayed for that activity. A suitable volume of such a sample is 
spotted on agarose plates which contain the insoluble chromogenic substrate AZCL-beta- 
glucan (Megazyme ™) and a suitable buffer at acidic pH. e.g. pH is 3-5. The plate is incu- 
45 bated for an appropriate time, e.g. one day, at an appropriate temperature, e.g. 550C. The ac- 
tivity is visible as blue halos around the spots. 
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Example 9. Determining acid phosphatase activity 

A suitable volume of the culture fluid or a cell lysate of a host strain synthesising and secreting 
the acid phosphatase in a suitable buffer at a suitable pH at an appropriate temperature, e.g. 
55 °C is incubated with para-nitrophenolphosphate (pNPP) for measuring the enzyme activity. 
5 The products of the enzymatic reaction are p-nitrophenol and inorganic phosphate or Pi. NaOH 
is added to end the phosphatase assay after a suitable reaction time and forms p- 
nitrophenolate. The absorbation of p-nitrophenolate is measured optically at 405 nm. 
As an alternative, a suitable volume of the culture fluid or a cell lysate of a host strain synthe- 
sising and secreting the acid phosphatase in a suitable buffer at a suitable pH at an appropri- 
10 ate temperature, e.g. 55 °C is used for measuring the enzyme activity with the EnzChek™ Acid 
Phosphatase Assay Kit (E-12020) (Molecular Probes Europe BV; PoortGebouw, Rijnsburger- 
weg 10; 2333 AA Leiden, The Netherlands). 

Example 10. Determining polysaccharide deacetylase activity 

15 A suitable volume of the culture fluid or a cell lysate of a host strain synthesising and secreting 
the polysaccharide deacteylase in a suitable buffer at an appropriate temperature, e.g. 55DC is 
used for measuring the activity. Bacterial rnurein, N,/V-diacetylchitobiose (Sigma) or galactose 
pentaacetate (Sigma) or and cellulose acetate (Sigma) can be used as substrate(s) for this en- 
zyme type. The acetate released from the substrate by the enzyme can be measured with an 

20 acetic acid assay kit (Biopharm) adapted for the physical requierments of the enzyme (Kosugi 
A, Murashima K, and Doi RH; Xylanase and Acetyl Xylan Esterase Activities of XynA, a Key 
Subunit of the Clostridium cellulovorans Cetlulosome for Xylan Degradation; Appl. Environm. I 
Microbiol; vol. 68; pp. 6399-6402; 2002) 

25 Example 11. Determining endo-beta-N-acetylglucosaminidase activity 

A suitable volume of the culture fluid or a cell lysate of a host strain synthesising and secreting 
the endo-beta-N-acetylglucosaminidase activity in a suitable buffer, e.g. pH 3-5, at an appro- 
priate temperature, e.g. 55 °C can be used for measuring the activity in accordance with MH 
Rashid, M Mori and J Sekiguchi; Glucosaminidase of Bacillus subtilis: cloning, regulation, pri- 
30 mary structure and biochemical characterization; Microbiology; vol. 141 ; pp. 2391-2404; 1995. 

Example 12. Determining peptidyl proly-isomerase activity 

A suitable volume of the culture fluid or a cell lysate of a host strain synthesising and secreting 
the polysaccharide deacteylase in a suitable buffer at an appropriate temperature, e.g. 55 °C is 
35 used for measuring the activity. The activity can be determined in accordance to Fischer, G., 
Bang, H. and Mech, C; Determination of enzymatic catalysis for the cis-trans-isomerization of 
peptide binding in proline-containing peptides.; Biomed. Biochim. Acta; vol. 43; pp. 1101- 
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1 11 1;1984. This assay may be modified appropriately to suit the specific peptidyl proly- 
isomerase such as that comprised in SEQ ID NO: 36. 

Example 13. Determining acid cellulase activity 

5 The culture fluid or a cell lysate of a host strain synthesising and secreting an acid cellulase in 
a suitable buffer may be assayed for that activity. A suitable volume of such a sample is spot- 
ted on agarose plates which contain the insoluble chromogentc substrate AZCL-HE-cellulose 
(Megazyme ™) and a suitable buffer at acidic pH, e.g. pH is 3-5. The plate is incubated for an 
appropriate time, e.g. one day, at an appropriate temperature, e.g. 55DC. Presence of acid 
10 cellulase is visible as blue halos around the spots. 

Example 14. Determining xylan deacetylase activity 

A suitable volume of the culture fluid or a cell lysate of a host strain synthesising and secreting 
the polysaccharide deacteylase in a suitable buffer at an appropriate temperature, e.g. 55QC 

15 can be used for measuring xylan deacetylase activity. Xylan deacetylase activity is measured 
as acetate release from acetylated xylan, which is prepared from birchwood xylan by the 
method of Johnson et al. 1988 (Johnson, K. G., J. D. Fontana, and C. R. Mackenzie. 1988. 
Measurement of acetylxylan esterase in Streptomyces. Methods Enzymol. 160:551-560). The 
acetate released from acetyl xylan is measured with an acetic acid assay kit (Biopharrn) 

20 adapted for the physical requierments of the enzyme (Kosugi A, Murashima K, and Doi RH; 
Xyfanase and Acetyl Xylan Esterase Activities ofXynA, a Key Subunit of the Clostridium cellu- 
lovorans Ceflutosome for Xylan Degradation, Appl. Environm. I Microbiol.; vol. 68; pp. 6399- 
6402; 2002). 

25 Example 15. Determining phytase activity 

The culture fluid or a cell lysate of a host strain synthesising and secreting a phytase in a suit- 
able buffer may be assayed for phytase activity. A suitable volume of such a sample is diluted 
in 0.1 M sodium acetate and 0.01% Tween-20, pH 5.5 in a suitable buffer, which can be -HCI 
at pH 3.0 to 3.5, sodium acetate at pH 4.0 to 5.5, morpholincethanesulfonic acid (MES) at pH 

30 6.0 to 6.5, and Tris-HCI at pH 7.0 to 9.0, are further diluted in 26-fold into the substrate solution 
(5 mM sodium phytate [Sigma] in 0.1 M sodium acetate, and 0.01% Tween-20 [pH 5.5], and 
preincubated at 37°C) to start the reaction. After 30 min at 37°C, the reaction is stopped by 
adding an equal volume of 10% trichloroacetic acid. Free inorganic phosphate is measured by 
the addition of an equal volume of molybdate reagent containing, in 100 ml, 7.3 g of FeS0 4 , 

35 1.0 g of (NH 4 )6Mo 7 0 2 4 *4H 2 0, and 3.2 ml of H 2 S0 4 . Absorbance was measured at 750 nm 

(Vmax microtiter plate reader; Molecular Devices) (Lassen SF; Breinholt J; O stergaard PR; 

Brugger R; Bischoff A; Wyss M; Fuglsang CC; Expression, gene cloning, and characterization 
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of five novel phytases from four basidiomycete fungi: Peniophora lycii, Agrocybe pediades, a 
Ceriporia $p„ and Trametes pubescens. Appl. Environ. Micr.; 67; pp. 4701-4707; 2001). 

Example 16. Determining phospholipase activity 

5 The culture fluid or a cell lysate of a host strain synthesising and secreting a phospholipase in 
a suitable buffer may be assayed for phospholipase activity. Lecithin is added to suitable vol- 
ume of such a sample. The Lecithin is hydrolyzed under constant pH and temperature, and the 
phospholipase activity is determined as the rate of titrant (0.1N NaOH) consumption during 
neutralization of the liberated fatty acid. The substrate is soy lecithin (L-a-Phosphotidyl- 
10 Choline), and the conditions are pH 8.00, 40.0°C, reaction time 2 min. The unit (LEU) is de- 
fined relative to a standard. 

Example 17: Expression of aspartyl protease gene (SEQ ID NO: 2) in Bacillus subtilis. 

The signal peptide from the protease SAVINASE™ (also known as subtilisin 309 

15 from B. Licheniformis from Novozymes A/S) was fused by PCR in frame to the gene encoding 
the aspartyl protease (SEQ ID NO: 2). The DNA coding for the resulting coding sequence was 
integrated by homologous recombination on the Bacillus subtilis host cell genome. The gene 
construct was expressed under the control of a triple promoter system (as described in WO 
99/43835), consisting of the promoters from Bacillus licheniformis alpha-amylase gene (arnyL), 

20 Bacillus amyloliquefaciens alpha-amylase gene (amyO), and the Bacillus thuringiensis crylllA 
promoter including stabilizing sequence. The gene coding for Chloramphenicol acetyl- 
transferase was used as maker. (Described e.g in Diderichsen et al., A useful cloning vector 
for Bacillus subtilis, Plasmid, 30, p. 312, 1993). 

Chloramphenicol resistant transformants were analyzed by DNA sequencing to 

25 veriify the correct DNA sequence of the construct. One such clone was selected. 

Fermentations of the aspartyl protease (SEQ ID NO: 2) expression clone was 
performed on a rotary shaking table in 500 ml baffled Erlenmeyer flasks each containing 100 
ml PS-1 media supplemented with 6 mg/l chloramphenicol. The clone was fermented for 6 
days at 37 °C and sample was taken at day 3, 4, 5 and 6 and analyzed for proteolytic activity. 

30 The activity was determined (se example 7) as a spot test of 20 microliter culture fluid on 0.1% 
AZCL-collagen (Megazyme ™) LB-PG agar plates at pH 3.4. The plates were incubated at 55 
°C (over night) and the activity was visible as blue halos around the spots. 

Purification 

Culture broth was centrifuged (20000 x g, 20 min) and the supernatant was carefully decanted 
35 from the precipitates. The pH in the supernatant was decreased to pH 4.0 by citrate-buffer 
(100 mM, pH 4.0) and incubated at 70 °C for 15 min. The heat-treated supernatant was 
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centrifuged (20000 x g, 20 min) and again the supernatant was carefully decanted from the 
precipitates. The supernatant was further analyzed by SDS-PAGE. Essentially only one band 
was seen on the coomassie stained SDS-PAGE gel and spot test of the supernatant (20 
microliter supernatant on 0.1% AZCL-collagen (Megazyme ™) LB-PG agar plates, pH 3.4 at 
5 55 °C, ON) confirmed the aspartyl protease activity, visible as blue halos around the spots. 

Example 28: Expression of acid cellulase gene (SEQ IO NO: 1) in Bacillus subtilis. 

The signal peptide from Termamyl™ (Novozymes) was fused by PGR in frame to the 
gene encoding the acid cellulase (SEQ ID NO: 1). The DNA coding for the resulting coding 

10 sequence was integrated by homologous recombination on the Bacillus subtilis host cell 
genome. The gene construct was expressed under the control of a triple promoter system (as 
described in WO 99/43835), consisting of the promoters from Bacillus licheniformis alpha- 
amylase gene (amyL), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), and the 
Bacillus thuringiensis crylllA promoter including stabilizing sequence. The gene coding for 

15 Chloramphenicol acetyl-transferase was used as maker (Described e.g in Diderichsen et al., A 
useful cloning vector for Bacillus subtilis, Plasmid, 30, p. 312, 1993). 

Chloramphenicol resistant transformants were analyzed by DNA sequencing to verify 
the correct DNA sequence of the construct. One such clone was selected. 

Fermentations of the acid cellulase (SEQ ID NO: 1) expression clone was performed 

20 on a rotary shaking table in 500 ml baffled Erfenmeyer flasks each containing 100 ml PS-1 
media supplemented with 6 mg/l chloramphenicol. The clone was fermented for 3 days at 37 
°C and sample was taken at day 1,2 and 3 and analyzed for cellulase activity. The activity was 
determined as a spot test of 20 microliter culture fluid on 0.1% AZCL-HE-cellulase (Megazyme 
™) LB-PG agar plates at pH 3.4. The plates were incubated at 55 °C (over night) and the 

25 activity was visible as blue halos around the spots. 
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CLAIMS 



1. An isolated polypeptide selected from the group consisting of: 

(a) a polypeptide comprising an amino acid sequence which has at least 90% identity with 
5 a sequence of a mature polypeptide comprised in the group of SEQ ID NO: 26 to SEQ 

ID NO:50; 

(b) a polypeptide which is encoded a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 

10 (i) the complementary strand to a nucleotide sequence selected from the group of 

regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide, 
(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from the group of regions of SEQ ID NO: 1 to SEQ ID NO: 25 
encoding a mature polypeptide 



15 



(c) a fragment of a mature polypeptide comprised in SEQ ID NO:26 to SEQ ID NO:50 and 



wherein the polypeptide has a function of the corresponding mature polypeptides comprised in 
SEQ ID NO:26 to SEQ ID NO:50. 

20 

2. The polypeptide of claim 1, wherein the polypeptide is an enzyme having a function se- 
lected from the group consisting of acid endoglucanase, acid cellulase, aspartyl protease, multi 
copper oxidase, serine-carboxyl protease, serine protease, HtrA-like serine protease, disulfide 
isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo-beta-N- 
25 acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase, phytase, phospholipase 
C, polysaccharide deacetylase, xylan deacetylase and sulfite oxidase. 



3. The enzyme of claim 2 selected from the group consisting of: 

(a) an enzyme having an amino acid sequence which has at least 90% identity with an 
30 amino acid sequence selected from mature enzymes comprised in SEQ ID NO: 26 to 

SEQ ID NO:40; 

(b) an enzyme which is encoded by a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 
(i) the complementary strand to a nucleotide sequence selected from the group of 

35 regions of SEQ ID NO: 1 to SEQ ID NO: 15 encoding the mature enzyme, 
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(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from regions of SEQ ID NO: 1 to SEQ ID NO: 15 encoding 
the mature polypeptide 

5 (c) a fragment of the mature enzyme comprised in SEQ ID NO:26 to SEQ ID NO:40, and 

wherein the enzyme has a function of the corresponding mature polypeptides comprised in 
SEQ ID NO:26 to SEQ ID NO:40. 

10 4. The polypeptide of claim 1 wherein the stringency conditions are very high. 

5. The polypeptide of claim 1 , wherein the polynucleotide encoding the polypeptide consists 
of a nucleotide sequence selected from the group of regions of SEQ ID NO: 1 to SEQ ID NO: 
25 encoding a mature polypeptide or a sequence differing there from by virtue of the degener- 

1 5 acy of the genetic code. 

6. The enzyme of claim 2 f wherein the characterized by being an acid endoglucanase or an 
acid cellulase obtainable from the strain of Alicyclobacillus sp. Deposited under DSM acces- 
sion No. 15716. 

20 

7. The acid endoglucanase or acid cellulase of claim 6, comprising or consisting of the mature 
acid endoglucanase or acid cellulase comprised in SEQ ID NO: 26. 

8. The acid endoglucanase or acid cellulase of claim 7, comprising or consisting of the se- 
25 quences from position 25 to 959 of SEQ ID NO: 26. 

9. A bacterial aspartyl protease. 

10. The aspartyl protease of claim 9, characterised by being free of disulphide bridges in the 
30 protease structure. 

11. The aspartyl protease of claim 10, obtainable from the strain of Alicyclobacillus sp. Depos- 
ited under DSM accession No. 15716. 

35 1 2. The aspartyl protease of claim 1 1 , comprising or consisting of the mature aspartyl protease 
comprised in SEQ ID NO: 27. 
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13. The aspartyl protease enzyme of claim 12, comprising or consisting of the sequences from 
position 33 to 272 of SEQ ID NO: 27. 

14. The enzyme of claim 2 t characterized by being a multi copper oxidase obtainable from the 
5 strain of Aficyclobacillus sp. Deposited under DSM accession No. 15716. 

15. The multi copper oxidase of claim 14, comprising or consisting of the mature multi copper 
oxidase comprised in SEQ ID NO: 28 or SEQ ID NO: 35. 

10 16. The multi copper oxidase of claim 15, comprising or consisting of the sequences from posi- 
tion 26 to 315 of SEQ ID NO: 28 or position 50 to 597 of SEQ ID NO: 35. 

17. The enzyme of claim 2, characterized by being a serine-carboxyl protease obtainable from 
the strain of Aficyclobacillus sp. Deposited under DSM accession No. 15716. 

15 

18. The serine-carboxyl protease of claim 17, comprising or consisting of the mature serine- 
carboxyl protease comprised in SEQ ID NO: 29 or SEQ ID NO: 30. 

19. The serine-carboxyl protease of claim 18, comprising or consisting of the sequences from 
20 position 190 to 626 of SEQ ID NO: 29 or position 25 to 533 of SEQ ID NO: 30. 

20. The enzyme of claim 2, characterized by being a serine protease or a HtrA-like serine pro- 
tease obtainable from the strain of AlicyclobaciUus sp. Deposited under DSM accession No. 
15716. 

25 

21. The serine protease or a HtrA-like serine protease of claim 20, comprising or consisting of 
the mature serine protease or a HtrA-like serine protease comprised in SEQ ID NO: 31. 

22. The serine protease or a HtrA-like serine protease of claim 21, comprising or consisting of 
30 the sequences from position 42 to 41 1 of SEQ ID NO: 31 . 

23. The enzyme of claim 2, characterized by being a disulfide isomerase obtainable from the 
strain of AlicyclobaciUus sp. Deposited under DSM accession No. 15716. 

35 24. The disulfide isomerase of claim 23, comprising or consisting of the mature disulfide isom- 
erase comprised in SEQ ID NO: 32. 
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25. The disulfide isomerase of claim 24, comprising or consisting of the sequences from posi- 
tion 42 to 41 1 of SEQ ID NO: 32. 

26. The enzyme of claim 2, characterized by being a gamma-D-glutamyl-L-diamino acid ob- 
5 tainable from the strain of Alicyclobacillus sp. Deposited under DSM accession No. 15716. . 

27. The gamma-D-glutamyl-L-diamino acid of claim 26, comprising or consisting of the mature 
gamma-D-glutamyl-L-diamino acid comprised in SEQ ID NO: 33. 

10 28. The gamma-D-glutamyl-L-diamino acid of claim 27 f comprising or consisting of the se- 
quences from position 30 to 266 of SEQ ID NO: 33. 

29. The enzyme of claim 2, characterized by being an endo-beta-N-acetylglucos-aminidase 
obtainable from the strain of Alicyclobacillus sp. Deposited under DSM accession No. 15716. 

15 

30. The endo-beta-N-acetylglucosaminidase of claim 29 % comprising or consisting of the ma- 
ture endo-beta-N-acetylglucosaminidase comprised in SEQ ID NO: 34. 

31. The endo-beta-N-acetylglucosarninidase of claim 30, comprising or consisting of the se- 
20 quences from position 27 to 768 of SEQ ID NO: 34. 

32. The enzyme of claim 2, characterized by being a peptidyl-prolyl-isomerase obtainable from 
the strain of Alicyclobacillus sp. Deposited under DSM accession No. 15716. 

25 33. The peptidyl-prolyl-isomerase of claim 32 p comprising or consisting of the mature peptidyl- 
prolyl-isomerase comprised in SEQ ID NO: 36. 

34. The peptidyl-prolyl-isomerase of claim 33, comprising or consisting of the sequences from 
position 30 to 246 of SEQ ID NO: 36. 

30 

35. The enzyme of claim 2, characterized by being an acid phosphatase or a phytase or a 
phospholipase C obtainable from the strain of Alicyclobacillus sp. Deposited under DSM ac- 
cession No. 15716. 

35 36. The acid phosphatase or phytase or phospholipase C of claim 35, comprising or consisting 
of the mature acid phosphatase or phytase or phospholipase C comprised in SEQ ID NO: 37. 
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37. The acid phosphatase or phytase or phospholipase C of claim 36, comprising or consisting 
of the sequences from position 28 to 608 of SEQ ID NO: 37. 

38. The enzyme of claim 2, characterized by being a polysaccharide deacetylase or a xylan 
5 deacetylase obtainable from the strain of Alicyclobacillus sp. Deposited under DSM accession 

No. 15716. 

39. The polysaccharide deacetylase or a xylan deacetylase of claim 38, comprising or consist- 
ing of the polysaccharide deacetylase or a xylan deacetylase comprised in SEQ ID NO: 38 or 

10 SEQ ID NO: 39. 

40. The polysaccharide deacetylase or a xylan deacetylase of claim 39, comprising or consist- 
ing of the sequences from position 26 to 251 of SEQ ID NO: 38 or from position 22 to 324 of 
SEQ ID NO: 39. 

15 

41. An isolated enzyme selected from the group consisting of: 

(a) an enzyme comprising an amino acid sequence which has at least 90% identity with 
the amino acid sequence of a mature enzyme selected from the group consisting of 
acid endoglucanase or acid cellulase, aspartyl protease, multi copper oxidase, serine- 

20 carboxyl p rotease, s erine p rotease o r H trA-like serine p rotease, d isulfide i somerase, 

gamma-D-glutamyl-L-diamino acid endopeptidase, endo-beta-N- 

acetylglucosaminidase. peptidyl-prolyl-isomerase, acid phosphatase or phytase or 
phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase 
secreted from t he strain of A HcyclobacHlus sp. Deposited u nder D SM a ccession N o. 

25 15716 

(b) a polypeptide which is encoded by a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of: 

(i) the complementary strand to a nucleotide sequence comprised in the strain of 
30 Alicyclobacillus sp. Deposited under DSM accession No. 15716 encoding a ma- 

ture enzyme selected from the group consisting of acid endoglucanase or acid 
cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl protease, 
serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 
glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, 
35 peptidyl-prolyl-isomerase, acid phosphatase or phytase or phospholipase C, 

polysaccharide deacetylase or xylan deacetylase and sulfite oxidase secreted 
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from that strain; 

(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences comprised in the strain of Alicyclobacillus sp. Deposited under DSM 
5 accession No. 15716 encoding a mature enzyme selected from the group con- 

sisting of acid endoglucanase or acid cellulase, aspartyl protease, multi copper 
oxidase, serine-carboxyl protease, serine protease or HtrA-like serine protease, 
disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or 
10 phytase or phospholipase C, polysaccharide deacetylase or xylan deacetylase 

and sulfite oxidase secreted from that strain 

(c) a fragment of a mature enzyme selected from the group consisting of acid endogluca- 
nase or acid cellulase, aspartyl protease, multi copper oxidase, serine-carboxyl prote- 
15 ase f serine protease or HtrA-like serine protease, disulfide isomerase, gamma-D- 

glutamyl-L-diamino acid endopeptidase, endo-beta-N-acetylglucosaminidase, peptidyl- 
prolyl-isomerase, acid phosphatase or phytase or phospholipase C, polysaccharide 
deacetylase or xylan deacetylase and sulfite oxidase secreted from the strain of Alicy- 
clobacillus sp. Deposited under DSM accession No. 15716 

20 

wherein the enzyme have a function selected from acid endoglucanase or acid cellulase, as- 
partyl protease, multi copper oxidase, serine-carboxyl protease, serine protease or HtrA-like 
serine protease, disulfide isomerase, gamma-D-glutamyl-L-diamino acid endopeptidase, endo- 
beta-N-acetylglucosaminidase, peptidyl-prolyl-isomerase, acid phosphatase or phytase or 
25 phospholipase C, polysaccharide deacetylase or xylan deacetylase and sulfite oxidase. 

42. A polynucleotide having a nucleotide sequence which encodes for the polypeptide defined 
in any of claims 1-41. 

30 43. A nucleic acid construct comprising the nucleotide sequence defined in claim 42 operably 
linked to one or more control sequences that direct the production of the polypeptide in a host 
cell. 

44. A recombinant expression vector comprising the nucleic acid construct of claim 43. 

35 

45. A recombinant host cell comprising the nucleic acid construct of claim 43. 
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46. A method for producing the polypeptide of claims 1-41 comprising: 

(a) cultivating a strain, which in its wild-type form is capable of producing the polypeptide, 
to produce the polypeptide; and 

5 (b) recovering the polypeptide. 

47. A method for producing a polypeptide of claims 1-41 comprising: 

(a) cultivating a recombinant host cell as defined in claim 45 under conditions conducive 
for production of the polypeptide; and 

0 

(b) recovering the polypeptide. 



48. A composition comprising the polypeptide of claim 1-41 . 

15 49. A method for preparing a composition of claim 7 comprising admixing the polypeptide of 
claim 1-41 with an excipient. 

50. A storage medium suitable for use in an electronic device comprising information of the 
amino acid sequence of the polypeptide of claims 1-41 or the n ucleotide sequences of the 

20 polynucleotide of claim 42. 

51. A process comprising employing a polypeptide of claims 1-41 or a polynucleotide of claim 
42 in an industrial or household technical process. 
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ABSTRACT 

Isolated polypeptides are disclosed selected from the group consisting of: 

(a) polypeptides comprising an amino acid sequence which has at least 90% identity with a 
5 sequence of a mature polypeptide comprised in the group of SEQ ID NO: 26 to SEQ ID 

NO:50; 

(b) polypeptides which are encoded a nucleotide sequence which hybridize under high 
stringency conditions with a polynucleotide probe selected from the group consisting of 

10 (i) the complementary strand to a nucleotide sequence selected from the group of 

regions of SEQ ID NO: 1 to SEQ ID NO: 25 encoding a mature polypeptide, 
(ii) the complementary strand to the cDNA sequence contained in a nucleotide se- 
quences selected from the group of regions of SEQ ID NO: 1 to SEQ ID NO: 25 
encoding a mature polypeptide 



15 



(c) fragments of mature polypeptides comprised in SEQ ID NO:26 to SEQ ID NO:50 and 

wherein the polypeptides have a function of the corresponding mature polypeptides comprised 
in SEQ ID NO:26 to SEQ ID NO:50 



20 
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SEQUENCE LISTING 

<110> Novozymes A/s 

<120> POLYPEPTIDES OF ALICYCLOBACILLUS SP. 
<130> NZ 10406 
<160> 55 

<170> Patentin version 3.2 

<210> 1 

<211> 2877 

<212> DNA 

<213> Alicyclobacillus sp. 



Patent- oq 
Varemsrkestyreken 

0 h FEB. 2004 
Modtaget 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mi s cofeature 
CI).. (2877) 
CDS 



misc_feature 
CD ■ ■ (72) 
sig_peptide 



mi s cofeature 
(73).. (2877) 
mat_peptide 



<400> 1 

ttgaagactc gatggtcagg tgcgctggct gtgctcatcg ccctcggaac 
cccgcttggg ccagtgtcca cagcgcggcc acgcacgcaa aggcgcacgt 
gctgcggata tggccgcagc gtccatgtcg gccgagattc agattctgca 
acggcttccg agctgtcgtc cgtccaggcc gcggcacagg ccgccgccaa 
tccacgtggg tgagctggct gtatccgagc gcctcctcgc cgagcgccgc 
acggcgcagg ccctgggcgc gctcctcacc ttggtcacgt atggcgccgt 
ggccaaaaca tcgcacagaa tttgcaaacc cttcagtcga cttcgccgct 
gcggccgtct cgatgttcta tcaaaacttc ttcgtgctcg tcggccaatc 
gtgctttcgg gccaggcaac cacctccacc gccggccacg ccctcgccca 
ctgacgccac agctcgccgc gtacctgcgc caatccggtc tttcgccgga 
cgcgcctacg tgagctttgc ctccgccgtg gattcgcagg gcgcggcgca 
ctgacgcgca tctgcaccaa catcctgggc tttggcgcgc cgacctccac 
accgtcaacg ccgcggcgaa ccttggacag gtgccgacca ccgcgtttgg 
gccgtgtggg acagcggtct caactcccag accgtcatct ccgaggtgca 
cccgccctca tccgctggcc cggaggctcc atctcggacg tgtacaattg 
acgcggaacg acggcggcta cgtgaatccc gacgacacgt ttgatcactt 
gtgaatgccg tcggctccac gcctatcatc acggtcaact acggcaccgg 

Page 1 



gggtgcctcg 


60 


cggcgtgcgc 


120 


cgacgcgctc 


180 


cctgcctgcc 


240 


acagacgcag 


300 


cgcagacgat 


360 


cttatcgccc 


420 


gtccaaatcc 


480 


agcggccgcg 


540 


cgatctcgcc 


600 


aacggctctc 


660 


ggcgaccatc 


720 


cctgaacgcg 


780 


agcgctccac 


840 


ggagaccaac 


900 


catgcagttt 


960 


cacgccacag 


1020 



ctcgccgccg 


actgggtgaa 


gtacgccgac 
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gtgacccacc acgacaacgt 


catgtattgg 


1080 


gaaattggca 


acgagattta 


cggcaacggt 


f actacaacG 


acaacaaata 

yLoav.yyy k.vj 


ggaggcggac 


1140 


gatcacgccg 


tggccggcca 


gccgcaaaaa 


aacaacccta 


atttaaaccc 


gcaggcgtac 


1200 


gcgcaaaacg 


ccctgcagtt 


catcaaggcg 


atacacacca 




catcaagatt 


1260 


ggggccgtgc 


tcacgatgcc 


gtacaactgg 


ccQtacaaca 


caaccatoaa 


cggcaacgac 


1320 


gactggaata 


ccgtcgtcct 


gaaggcgctc 


aonccctaca 




ggacgtgcac 


1380 


tggtaccccg 


agacgcccgg 


gcaggagacc 


aac qccqqcc 




cacagatcaa 


1440 


atccccgcca 


tggtggcgga 


gctcaagcgc 


y d d y Li^ou^a 


LaLy LL.yy 


atcgaacgcg 


1500 


aagaacatcc 


aaatctttgt 


gaccgagacc 


oaLwy v.y l.q i_ 


rataf aaf rf 

L,y LdvaaLLL. 


cggcgagcag 


1560 


tcgaccaacc 


tgcctgaagc 


gctcttcttg 


yi,yyaLya l. v_ 


L\_a.L,v_yyy u L. 


catccaggcc 


1620 


ggcgcggcca 


acgtcgactg 


gtgggatctg 




rrnaflaaraa 

v_L.yciyycivcici 


ctacacaagc 


1680 


ccgagcctct 


acggccagaa 


cctgtttggc 


y d L La L.y ^uu 


trttotf rtr 

L \_ L. L. y LLL. l_ V_ 


cggccagacc 


1740 


acgcaaaacg 


gttggcagga 


gccgcccgcc 


uaL,u.\.y v.Ly^_ 


ttrrorrrta 


caatggcttc 


1800 


cagctggtct 


cggatttcgc 


gcagcccggc 


y aLav.y u Ly L 


LLyy L LL.L.UL 


cacgtcgcag 


1860 


agcgccatcg 


acgtgcacgc 


cgtgcgcaag 


r r n a a *t~ n ci rn 

LLljClu l_y y l_y 


uLu L. V- v_ y V. v. 


catgctcgtc 


1920 


aatcgcagcc 


catccgccat 


ctacagcgcc 


C CT C\Zk 7k C C\ 
adLt LyaaLy 


t* n r* 1" r n n n "t* t* 
iy<_ LL_y y y l l 


cgggccgttt 


1980 


gtcgtgacac 


atgcgctcgc 


gtacggtgaa 


yUL LLuuyLL 


nrnf onrorr 


catgccggtt 


2040 


cttcccgtcc 


ccggcgcgcc 


catcaagctc 




y L_yy y a. lllja 


tctcaccctg 


2100 


cacccgctca 


ttccggcgcc 


acacgccgcc 




rrnatarort 


cacgctgtct 


2160 


tcgcccacgg 


tgacggccgg 


cggtgcggag 


QLI^k< L L_ L. L. L>y 




ggcggatcga 


2220 


ccggttcatc 


acgccacggt 


ggagctcgag 


L. Ly La LyaL L 


v_ya.L_y<*c*L,ycL 


tctcgtcgcc 


2280 


acccacaccg 


tctcggatgt 


cgatcttcag 


LL^yya l L.y y 


LLdLaay Lya 


gacgtggagt 


2340 


ttcaccgcac 


cggccgcgaa 


cggcaattac 


<^y i_y c Ly dyy 


l_ y l l L.y ty l l 


tgacccggtg 


2400 


acgggcgcga 


cgtacgacgc 


ggacacgcag 


yyL.yv.yy l ll 


LyaLLy nuclei 


ccagccgcct 


2460 


caggcgacct 


acggcgacat 


cgtgacgaaa 


fiararonf ra 
y aLci^y y l \~cl 


trarnntnaa 
L L-ci L. y y Lyaa 


cgggacgacg 


2520 


tacgacgttc 


cggcacctga 


tgcgggcggg 


cactatccgt 


cggggacgaa 


tatttcggtg 


2580 


gcacccgggg 


acacggtgac 


cgtgcagacg 


acgtttgtca 


acgtctcatc 


gacggacgcg 


2640 


ctgcagaacg 


ggctcatcga 


catggaagtg 


gacggatcga 


acggggccat 


cctgcagaaa 


2700 


tactggccga 


gcacgactct 


tttgcctggc 


caatcggaga 


cggtgacggc 


gacgtggcaa 


2760 


gtgcctgcga 


atgtggcggc 


cggaacgtac 


ccgctcaact 


tccaggcctt 


caacacgagc 


2820 


agctggacgg 


gaaactgtta 


cttcacaaac 


ggtggcgtgg 


tcaacttcgt 


gatcagc 


2877 



<210> 2 

<211> 816 

<212> DNA 

<213> Alicyclobacillus sp. 
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<220> 

<221> mi sc_feature 

<222> (1). .(816) 

<220> 

<221> misc_feature 

<222> (1) . . (96) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (97).. (816) 

<223> mat_peptide 

<400> 2 



atgaacggca 


cctcagtctg 


gaaagcgtca 


ggcatcgcag 


ccgcctcgtg 


cctgacagcc 


60 


gcggcacttc 


tcgcctggcc 


ccacgccaca 


tccacgttgg 


acgcgtcgcc 


cgccatcrtc 


120 


cacgcgccgc 


ggcacgcgct 


ctcgcccaac 


accagcccga 


aaccgaacag 


cgtccaggca 


180 


cagaactttg 


gttggtcggc 


gtcgaactgg 


tcgggatatg 


ccgtgaccgg 


cagcacgtac 


240 


aacgacatca 


caggcagttg 


gattgtgcct 


gcggtgagcc 


catccaagag 


aagcacgtac 


300 


tcttcgagct 


ggatcggcat 


cgacgggttc 


aacaacagcg 


atctcattca 


aaccggcacg 


360 


gagcaggact 


atgtcaacgg 


tcacgcgcag 


tacgacgcct 


ggtgggaaat 


cctccccgcc 


420 


cccgagacgg 


tcatctcgaa 


catgaccatc 


gccccgggcg 


accggatgag 


cgcgcacatc 


480 


cacaacaacg 


gcaacggaac 


ctggacgatt 


acgttgacgg 


aicgtgacccg 


caacgagacg 


540 


ttctccacca 


cgcagtcgta 


ctcgggccct 


ggctcgtcgg 


ccgagtggat 


ccaggaggcg 


600 


ccggagatcg 


gcggccggat 


cgccacgctc 


gccaactacg 


gcgagaccac 


gttcgatccc 


660 


ggcaccgtaa 


acggcggcaa 


cccaggtttt 


accctgtccg 


acgcgggcta 


catggtgcag 


720 


aacaacgcgg 


tcgtgtctgt 


gccgtccgca 


cccgactcgg 


ataccgacgg 


cttcaacgtg 


780 


gcctacggct 


ccaaccagcc 


gagcccaccg 


gcctcc 






816 



<210> 3 

<211> 945 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_feature 

<222> (1) . . (945) 

<223> CDS 

<220> 

<221> misc_feature 

<222> (1). .(75) 

<223> si g_pepti de 

<220> 

<221> mnsc_f eature 

<222> (76). .(945) 

<223> mat_peptide 

<400> 3 

atgagaagac gcatgtcagg ctttgcgacg ggccttggca tcgcggcggg gctcgccctc 60 
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agttccgccc 


tcgccgcgcc 


gttcttccac 


gccgggaacg 


cgtccgcggc 


gtcgacgatg 


120 


tcgatggcgc 


cgacgagcac 


catgggcgcc 


ctgcccgcgc 


ccgaaggcgt 


gccggacgca 


180 


ggcccgctgt 


cgatcacgcc 


cgaggtcatt 


cgccaacaac 


aggctgacgc 


tgtccgggtc 


240 


atggacgaag 


aaggcctgaa 


gccacagatc 


ctctccggcg 


acatcaagcg 


attcaccctc 


300 


accgcgagcc 


aggtgaactg 


gtatttgtac 


cccggcaaag 


cggtcgtcgc gtgcggctac 


360 


aacggccaag 


tgcctggccc 


ggtcctccgc 


gtgcgcgtgg 


gcgatcgcgt 


ccaaatcctc 


420 


ctgagaaacg 


agctgaacga 


gcccaccacg 


ctgcacatcc 


agggcctcga 


tctgccggcg 


480 


tcgcagttgg 


gaatcggaga 


cgtcaccgaa 


tcccccatcc 


ctccgggcgg 


cgaacgcctg 


540 


tacagcttca 


ccgtgacgcc 


acagatggtg 


ggcacccacc 


tgtacgagag 


cggcacggat 


600 


atggccagcg 


agatcgaccc 


aaggactgca 


cggggtgctg 


ctcgtcgatc 


cggcccgggg 


660 


atccctttat 


ccccaggcga 


aggtggacgc 


gctcttcgag 


atcgacgcgt 


ggatggtgga 


720 


cggatcgacc 


accgaaaacg 


cgtttggcct 


ggacggcaag 


ccgtatcccg 


acgcgcccga 


780 


actgacggtg 


ccgtacggca 


gccgcgtggt 


gctgcgcatc 


gtcaacgcga 


gcgggatgtg 


840 


ctaccacgcc 


atgcacctgc 


acgagacgac 


gttttggctg 


ctggcggaag 


acgggcaccc 


900 


cctcgccaag 


ccgcggccga 


tgaacgtgct 


cgccatcgcg 


ccagg 




945 



<210> 4 

<211> 1878 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_feature 

<222> CD . . (1878) 

<223> COS 

<220> 

<221> mi sc_feature 

<222> CD-- (66) 

<223> si g_peptide 

<220> 

<221> misc_feature 

<222> C67). .C567) 

<223> propeptid 

<220> 

<221> mi sc_f eature 

<222> (568). .C1878) 

<223> mat_peptide 

<400> 4 



atgggcttgt 


ggaaacggct 


ggcgctcggc 


gtgcctgcgg 


cacttagcat 


gctcgcggtt 


60 


ggggtgcctg 


tgatgagcgc 


ggacaccgtg 


gaggctgcgc 


cgcttgcgaa 


tccttcaacg 


120 


gaaaatgcgc 


aggatatggg 


gccggcgagt 


gggagccaga 


cggtgacggc 


atccatcatt 


180 


ttgcgtgrgc 


agaatccgac 


ggcgctgcag 


aactacattc 


aagagacgga 


gacaccgggc 


240 


agtccgctgt 


accataagtt 


cttgacgacg 


gcgcagttcg 
Page 4 


ctcagcagta 


cgcgccttcg 


300 



10406. 5T2 5 

gcggcgaccc ttcagcagat tgagcaggag cttcagggct atgggctcca ggtcgtgaat 360 

gtcgacgcgg atcacctgga catgcaggtt cagggcacag ttcagcagtt tgacaacgcg 420 

ttcaacaccg tgatcgacct gtttaaggca aacgggcaca tcttccgcgc gccgaagaag 480 

ccgccgcaga tcccggtggc gcttctcacc aacgtgctcg ccgtggtggg actcgatacg 540 

gcacaggcgg cgcagtcgct cacggtgaag acgccgaacg tcgcgggtgt gccttcgccc 600 

aaggtggtgc ttccgcaggg aggcagcacg gcgacgggca cgccagggag ctacacggtt 660 

ggggatacgg cgaatcgcta cgacatcaac ccactctatc agaagggtat cacgggcaag 720 

ggcgagacca tcggcattgt gacgctgtcg agctttaatc cgcaggatgc ctacacctac 780 

tggcagggca ttgggctgaa ggtggctcca aaccgcatcc agatggtgaa tgtggacggc 840 

ggtggccaga tggatgatgg atcggtcgag acgacgctgg acgtggaaca gtcgggcggt 900 

ttggcgccgg acgccaacgt cgtggtgtac gacgcgccga atacggatca gggcttcatc 960 

gatgcgttct accaggcggt ctcggacaac caggcggatt ccctctcggt gagctgggga 1020 

cagcctgaaa tcgattacct gccgcagatg aaccaaggcc agtcgtatgt ggatgagctc 1080 

ctcgccttca cccaggcgtt catggaggcg gcggctcagg gcatttccat gtacgcggcc 1140 

gcgggggatt caggcgccta cgacacggcg cgcgacttcc cgccctccga tggcttcacc 1200 

acgccgctca gcgtggactt tcccgcctcc gacccgtaca tcacggctgc gggaggcacg 1260 

acggtaccgt tcaccgcaaa gttctcgctc ggcacggtca acatcacgca ggagcagccc 1320 

tggtcgtggc aataccttca aaacctcggc taccaagggc tcttctccgt gggcacaggc 1380 

ggtggcgtga gcgtcatctt cccgcgcccg tggtatcagc tcggcgtggg cggcatgcaa 1440 

aatagcgcgg ccaatcaggc cttcaccgac tcgcagggcg ttttgtacgg atcgccettc 1500 

acgtacaacc tgccgagcaa ttacgcgggc cggaatctgc cggacatctc catggatgct 1560 

gatccggaga cgggctatct ggtctactgg agcgcgggcg gtggctggat tgcgggctac 1620 

ggcgggacga gcttcgtggc gccgcagttg aacggtatca cggcgctcat tgatcaggag 1680 

gtccatgggc gagtgggctt cctcaatccg ctgctgtaca ccctgttgac gcaaggggtc 1740 

caaggtgggg cgcagccgtt ccacgacatc acgacgggga acaactggta ttggaatgcg 1800 

gtgcctggtt acgatccggc ctcgggcgtg ggcacgccgg acgtcgcgaa cttggcgcag 1860 

gacatcgcat cgctgaga 1878 

<210> 5 
<211> 1599 
<212> DNA 

<213> Alicyclobacillus sp. 

<220> 

<221> mi sc_feature 
<222> (1). .(1599) 
<223> CDS 

<220> 
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<221> mi sc_feature 

<222> (1)..(72) 

<223> sig_peptide 

<220> 

<221> mi sc__feature 

<222> (73).. (1599) 

<223> mat_peptide 

<400> 5 



atgcgagcgc 


tcgcacattt 


ggccattggc 


gccatcgcgt 


ccggcgtttt 


cgctgcacct 


60 


gtcgcttttg 


cctcgccggt 


tcaggaacgc 


gtggtggtgg 


cctcgcccga 


tccgcggacg 


120 


cgtagcgttc 


acgcggatgg 


cgaaatttca 


ccgtcgcagc 


cgatgcactt 


ggtcattacg 


180 


cttcgcctgc 


gccacgaggc 


gcagctcgag 


cagctgattc 


gagacctgta 


cacgccggga 


240 


tcgcccgatg 


caggtcactt 


cttgacgccc 


gcggcgttta 


acgcggcgta 


tgcaccgacg 


300 


gctgaggacg 


tgcaggccgt 


ggtccagggg 


ctgcgcgcat 


acggcctccg 


cgttgagccg 


360 


acggtaaatc 


ccatggtgct 


gaccgtgagt 


ggacgggccc 


gcgacgtcga 


gcgagcgttt 


420 


ggcgtgcatg 


agctccaatt 


tgggcgcgga 


gctggcgcat 


ggtacgcccc 


ggatggtgcg 


480 


gccacgctgc 


ctgcaccgct 


cgccgcgcgc 


gtgtcggccg 


tggtaggcct 


gacgagcgac 


540 


gcgatggagc 


gccacctcgt 


cctggcgcac 


gtcgcgccgg 


caggaggtgg 


ctacacgccc 


600 


gcgcaaattc 


agcgcgccta 


cgactatacg 


ccgctctaca 


gccaatacat 


ggggcgcgga 


660 


caggtcattg 


cggtggtgac 


ttccggctcg 


gtgctccgct 


ccgacctgct 


cgcgttcgat 


720 


cgcgccttcg 


ggcrtccgaa 


tccggtggtg 


cgccagcggg 


tgatcgacgg 


atcgtccacg 


780 


tctcccgacg 


acgagaccac 


cctcgactgc 


gagtgggcgc 


atgccatcgc 


gccgacggca 


840 


tcgctcgccg 


tgtacgaggc 


cgctcaaccg 


gacgcgcagt 


cgttcatcga 


tgcgtttgcc 


900 


caggtggcgg 


cggacgatgg 


cgcgcatgtg 


gtcacgacga 


gttggggagc 


gcccgagtcg 


960 


gagaccgacg 


cggcgaccat 


gcaggcggag 


caccagatct 


tcatgcagat 


ggccgcccag 


1020 


gggcagagcg 


tgttcgccgc 


ggcgggcgac 


agcggatcgt 


cggacggaac 


aagcgggacg 


1080 


gacgtcgact 


atccgtcgtc 


ggatccgtat 


gtcaccgcgt 


gtggcgggac 


gaggctcgtt 


1140 


cttggtgcgg 


gtgcaaagcg 


gctgcaggag 


acggcgtggg 


ccgacacggg 


cggcggcgcg 


1200 


agctcggtgt 


acggagagcc 


gtggtggcaa 


tatggcccgg 


gcgtgccgca 


gacgggctat 


1260 


cggcagacgt 


gcgacgtcgc 


cctgaacgcc 


gatccggcca 


cgggctacga 


tttcatctat 


1320 


gagggtcagt 


gggaggtggc 


cggggggacg 


agctttgtcg 


cgccgatgat 


ggccgcgacg 


1380 


tttgcgctca 


tcgaccaagc 


gcgtgccctc 


gaaggtaagc 


cacccgttgg 


gctcgcagac 


1440 


gtcggcatct 


atgcgatggc 


gcgcaacgcg 


tcctacgcgc 


cgtacgcatt 


ccacgacatc 


1500 


acggccggat 


cgaacggcgc 


gtacagcgcg 


ggcccgggat 


gggatcatcc 


aaccggcttt 


1560 


ggttccatcg 


acgcgtacta 


ctttttgcac 


gggctcgac 






1599 



<210> 6 
<211> 1233 
<212> DNA 
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<213> Alicyclobacillus sp. 



10406. 5T2 5 



<220> 

<221> misc^feature 

<222> (1). .(1233) 

<223> CDS 

<220> 

<221> miscfeature 

<222> (1)..(123) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (124).. (1233) 

<223> mat_peptide 



<400> 6 
atgcggcgtc 


gacgttggga 


ttacgaggac 


tggccgagtg 


agaacaggcg 


tgtcggcgtg 


60 


tggctcgcga 


gcgggaccgc 


gctgcttgcc 


atctgctaca 


tcctcggcat 


ctggacgggt 


120 


gcggcgctca 


cgcgcggtca 


ttcccagacg 


accgtggaat 


acgttcctcc 


ccagacgggc 


180 


aacaccgcga 


gcacgtccgg 


atcgctcacg 


ccgatcccgg 


gcgtcgagga 


cacgaccata 


240 


gtgacgcaga 


tttataaccg 


agtgaaaaat 


agcatcttta 


ccattacggc 


cgtctccgga 


300 


ggcaagccga 


cgtcgagcga 


cgcagaagaa 


gatatcggca 


cggggttcct 


gatcgatcac 


360 


aacggcgatc 


tcttgaccaa 


cgcgcatgtc 


gtcggatcgg 


ccacaacggt 


ccaggtgtcc 


420 


ggggacaacc 


gccaattcgt 


cggccgcgtg 


attgacgccg 


accagctgga 


cgatctcgcc 


480 


atcgttcgca 


tcccggcgcc 


caaatcgctg 


gaaccgctgc 


cgttgggatc 


ggtgaagtcg 


540 


cttcagccgg 


gcagcctggt 


catcgccatc 


ggcaacccgt 


ttgagctgac 


ctcgagcgtc 


600 


agctcgggca 


tcgtgagcgg 


actcaaccgg 


tcgatgtccg 


agtcgaacgg 


gcacgtgatg 


660 


aacggcatga 


tccagacgga 


cgcgccgctc 


aaccctggaa 


attcgggagg 


cccgctgctc 


720 


aacgcggcag 


gacaggtcgt 


cggcatcaac 


acgctgatcg 


aaagccctat 


cgaggggtcc 


780 


atcggcattg 


gctttgccat 


tcctatcgac 


cggtttatcc 


agctcgagcc 


agaattgctc 


840 


gccggcaaac 


ccgtcgcgca 


cgcctggctc 


ggcatcgagg 


gaatggacat 


cgacaacctg 


900 


atgcgtcaag 


cgctgcactt 


gcctgtggcc 


tcgggcgtct 


atgtgaccga 


agtgaccccg 


960 


ggcggccccg 


ccgcgaaagc 


ggggctgcgc 


ggagattcga 


acgcggccaa 


gttgaacagt 


1020 


ctaagccagt 


cggccaatcc 


gtacgcgctg 


ctcaagggga 


acggggacat 


catcgtcggg 


1080 


attgacggca 


agcaggtctc 


cagcatcgaa 


cagttgacgc 


aggatatcaa 


ccaagatcaa 


1140 


ccgggtcaga 


cggtggtgct 


caccgtgttg 


cgcgcaggca 


aaaccctgca 


cgtgcgcgtc 


1200 


acgctcggga 


cctggccatc 


cagccaaaat 


ccg 






1233 



<210> 7 

<211> 633 

<212> DNA 

<213> Alicyclobacillus sp. 
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<220> 

<221> mi s cofeature 

<222> (1)..(633) 

<223> CDS 

<220> 

<221> misc_feature 

<222> (1)..(90) 

<223> sig_peptide 

<220> 

<221> mi sc_f eature 

<222> (91). .(633) 

<223> mat_peptide 

<400> 7 



atgcgcaggt 


cttggagcgt 


gctcatggcc 


gtttgcatgt 


cttggttggc 


ggtggggtgt 


60 


ggcacgcctg 


caaactcgtt 


gtcacaagcg 


accgctgcgt 


ctggaaggca 


cgcgccgcac 


120 


cccctcgtgt 


ttcagaacct 


cacaggtgcc 


atgaacgagg 


ggcaggatcc 


ccggtgggac 


180 


ccgaaagcgg 


ctcccacggg 


tgtctacgac 


gacgtgaccg 


tggtcacagc 


gagtggccga 


240 


caggaggtgc 


tctccgttcg 


ggatgcgccg 


ctcctgttcg 


cagegtactg 


gtgccctcac 


300 


tgccagcgca 


cactgcagct 


tctcacgtcg 


attgaatcac 


gectgaagea 


aaageccatt 


360 


cttgtgaacg 


tcggctatcc 


tccgggcacg 


acactgcaga 


ccgcggcgcg 


catcgcgcgc 


420 


gaggagtctc 


aagttcttca 


cttggcgccg 


ttccaagagg 


tctttatctt 


gaatcctgat 


480 


gcaggggatc 


gatacgcccc 


gctagggtac 


ccaacactcg 


ctttttatcg 


cgccgggcga 


540 


gattggacgc 


tgtacggtga 


acatcgagcg 


tctatttggg 


aaaaggcect 


gtccgaatcg 


600 


acatcaaaag 


cgtacaatgg 


cagcgaggaa 


tea 






633 



<210> 8 

<211> 798 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> mi sc_feature 

<222> (1). .(798) 

<223> CDS 

<220> 

<221> mi sc_feature 

<222> (1)..(87) 

<223> sig_peptide 

<220> 

<221> mi sc__f eature 

<222> (88) . . (798) 

<223> mat_peptide 

<400> 8 

atggatgaga tgaacattcg atcttggtgt gtegctgett gtacegtage cttgacaagc 60 

gccgtgggcg cgacgaccgc gttcgcgcag acggtgaccg tacaacccgg acaatcgctc 120 

tggaccatcg cacgcgcaca cgggatgccc gttcagttgg tggcgtccgc caatccgcag 180 

tacaatccgc tgaatctccc tgttggtgcg accgtcacac ttcccagtct caaggaegtg 240 
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gctgtgcagc 


cgggcgactc 


cctgtttctg atcggcaggc aatatggcgt gtcgctcgcc 


300 


gagatgttgg 


ccgcaaaccc 


gaacgtggat ccattgaatc tgcaagtggg 


ttcaagtgtg 


360 


cgtgttcccc 


ttgcatcatc 


ttcgaccaag agctccacag tttctgccca 


tgttgccgca 


420 


tccacgcccg 


aaaactccaa 


caacctgtac tggttggagc gcgtcattca cgcggaggcc 


480 


ggcggagaat cgctgcaggc 


acaaatcgcc gtggccgacg tcattctcca 


tcgcatggcc 


540 


gcgggtggat 


acgggagcac 


ggtgcaacaa gtggtcttcc aagtgagcga 


cgggcactac 


600 


caattcgaga 


gtgtcgcaaa 


cggttcgatt tacggtcagc cagacgcaca 


aaacgtgcag 


660 


gctgctctcg 


acgcgttgaa 


cggagacgat gtcgtcccag gcgcgttggt 


cttctacaac 


720 


cccgcgcaga 


cgccttccgg 


aagttgggtt tggcaacaac ctgtggtcgc 


tcatatcggt 


780 


catctcgtgt 


ttgcgaag 






798 



<210> 9 

<211> 2304 

<212> DNA 

<213> Alicyclobacillus sp. 
<220> 

<221> misc_feature 

<222> (1). .(2304) 

<223> CDS 

<220> 

<221> mi sc_f eature 

<222> (1)..(78) 

<223> sig_peptide 

<220> 

<221> mi sc_f eature 

<222> (79).. (2304) 

<223> mat_peptide 

<400> 9 



gtgaagacgc 


atcgcctgct 


cgcggtcgcg 


gcactgcctg 


caacagtgct 


gttgacaacg 


60 


ccggcgcccg 


cgctggctga 


gacctcgagc 


tcgcagagcg 


cttcggcgcc 


gtcgctgaac 


120 


gtgccggtcg 


ctgccctgac 


cctcgcgggt 


gttcaatcgt 


atcccatgct 


gagctacgga 


180 


tccacgggcg 


tgtacgtgga 


aattttgcag 


aacgccctga 


atgccctggg 


ctatgacgtg 


240 


ggacaagcca 


gcgggctgtt 


cgacgccacc 


acgcaggccg 


aagtgaaggc 


ttttcagcag 


300 


gcgatgggcc 


tgcagacgga 


cggcattgtg 


ggtcccctga 


cctggggggc 


tttggcgaag 


360 


gcggtggccg 


attatcgcca 


ggtgatgacc 


gtactctcca 


gtcgcagctc 


gctggttcag 


420 


caagtcgaat 


ggaagcgcat 


cgtatggaac 


ggcaggttga 


tttcgaagcc 


catcggcttc 


480 


acgtaccagg 


ggacagcgta 


catgcccatt 


tggtacgtca 


tgcaggcgct 


tagcaaggcg 


540 


ggcattgcga 


gcacgtggca 


gggaggggtt 


tggacgctca 


cgccgcccgg 


aggtcagacc 


600 


gtgaattacg 


gaaagatctc 


gtacgggccg 


ggcagtgcgg 


ccatcgccat 


cggccagacc 


660 


gtggtcgcca 


atgtgcccgc 


ggtggtgtac 


cctgatccgg 


catccggaaa 


gctcacgacc 


720 
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♦* "t* f~ —1 #>i r~- r~m 

l tea xgccxg 


xxxggxacgx 


caxgaacgcg 


xxgeagegge 


xgggcaxegg 


xxcyacgxgg 


f OU 


cagggaaccg 


agtgggacat 


gaagccagcx 


cccgtcgtga 


tegagaeggg 


cgaxccgxcg 




^ *a #** ^ ^ ^* "a ^ ^ 

ddCadCdCCa 


ccggg icaga 


xcccgcgaac 


ageaegggea 


acggcaccgg 


yaacxcgacg 


Z7UU 


ggcaacgcca 


cgggcgccgx 


r~ ^ rj rt r\ r* r\ n r* 

gccaggcggc 


aaxaccgxga 


cgaacgxcac 


cacgggcxcg 




^ ^ ^ «^ r» +■ ^* *a 

LCCddCg ICd 


ccggcaacxc 


gaegggcaac 


agxxxgggga 


acxcgacggg 


caacagexxg 




ggcaacagca 


cgxcgaacgc 


gaegggcaax 


gccaccggca 


acaccaccgg 


gaaxgegace 


XUQU 


ggcaaxxeca 


egggcacgag 


cagegggxeg 


xxcacgaaxg 


xcgaccxgcg 


cxaxccggcg 




ccgxccaaca 


xcaaxgegea 


gagcaxcaac 


^ 4* 4— -4- 4— 0+ 

cagxxxcxg c 


tgcagaacag 


cxcgccgcxc 




aaxgggc rgg 


geaaxxegxx 


caxggacgcc 


cagaacctgt 


acagegxega 


cgccaacxac 




cxxgxcxcgc 


acgccatcct 


cgagagtgcg 


tgggggcaaa 


gecaaattge 


ccttcagaag 




aacaaxexgx 


ttggcxacgg 


cgexxacgat 


tcgaaccccg 


gaeaggatge 


gggegtatte 


X30VJ 


ccgagcgacg 


actacgccat 


ccgattcgag 


gcgtggaccg 


tgcgcatgaa 


ctacctcacg 


1 A Af\ 


ccgggcgcga 


gcttgtacgt 


gacgccgacg 


ct cageggaa 


tgaacgtgaa 


ctacgccaca 


-LjUU 


gccaagacct 


gggcaagegg 


cattgcggcc 


ateatgaege 


agtttgcgag 


ctccgtcgga 


i cert 


tcgaacg tga 


atgcgtacgt 


gcagtacacg 


ccgtccaaca 


atccgcccgc 


tccgagatcg 


T CTfl 

JLb^U 


acageggaac 


cggtgtacta 


ca tgaaegge 


gcgcaagggg 


taaegcagea 


ggatcegtat 


t con 


xacccgaaxg 


gcggcgttcc 


gt acxacccg 


accatcgcgc 


agggtgagaa 


tcagcagttc 


t 7 /i r\ 
1/40 


xxrggccagc 


taagtgtcgg 


cagcxTcggx 


caacccgtgg 


tggaggttca 


gcagttcctg 


loUU 


aaccggacca 


teaacgeggg 


getgacegtg 


gaegggcagt 


ttggcccgct 


gacgcaggcc 


T OCA 


geggtcgaga 


agttccagtc 


gcaggtcatg 


cacatgtcga 


acccgaacgg 


catttggacg 




xxcagcaxgx 


gggtccagta 


catccagcct 


tctcagtcga 


acgccaatct 


catcccggct 


lyso 


gggaccaccg 


tgaaaattga 


ccaggtcgcc 


gagggcatgg 


cgggcccgta 


cgtcgtgcct 


2040 


tggtaccacg 


tggtgggcta 


tggctgggtc 


gaetegcagt 


atatcaagtt 


gaecaaegtg 


2100 


tatcgegtea 


ttgtgcagaa 


cccggccgga 


acggccacca 


ccattcccgt 


ctaccaggtg 


2160 


ggcaacctgt 


etteggtatt 


gctcaatctg 


cacageggag 


actgggtggt 


tgccaactca 


2220 


gcgcagccct 


egggeggegt 


gtacaccatt 


cagattgegg 


ctcaggatcc 


accgtgtcga 


2280 


acggctacgc 


cgccgggacg 


ctct 








2304 



<210> 10 

<211> 1791 

<212> DNA 

<213> Al icyclobacillus sp. 



<220> 

<221> mi sc_f eature 

<222> (1). .C1791) 

<223> CDS 

<220> 

<221> mi sc_f eature 
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<222> CD ■ - (147) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (148) . . (1791) 

<223> mat_peptide 



<400> 10 
atgatggccc 


acgatagatt 


ggacaggcga 


gtgaatgaga 


ggaggcaagc 


catgcgacgc 


60 


gcggcaaaat 


gggcaatcgc 


ccttggcacg 


acggcagtgg 


tggctggtgt 


cagcagcgtg 


120 


ttcgcacttc 


gcagtgtgcg 


agaagcaaac 


ctgaatccca 


acgcccctct 


cgcgaacgtg 


180 


cccgggcctc 


agggcgccta 


tacgcccatc 


agcgcgcttc 


agcccgtcgt 


tccgaaaaac 


240 


gcgcggatcg 


accactacac 


gctgacggcg 


gaatcccgca 


cactgaccgt 


cggcggccat 


300 


gccctgcaag 


ccatgacgtt 


caacggcacc 


gcgccagggc 


cgttgcttgt 


ggcccatcaa 


360 


ggcgacgtcg 


tgaaggtcac 


ggtgcacaac 


cgcctctccg 


tccctctgac 


cattcactgg 


420 


cacggcatcg 


cggtgcccgg 


cgcggaagac 


ggcgtccctg 


gtgtcacgca 


aaacccaatt 


480 


ccgcctggcg 


ggagctacac 


gtacgagttt 


caggttaacc 


agcccggaac 


gtactggtac 


540 


cactcgcacg 


aggcgagctt 


tgaagaggtg 


ggcctcgggt 


tgtacggcgc 


cttcgtcgtt 


600 


ctgcccaaac 


gggcggtcca 


tccggccgat 


cgcgactaca 


cgctcgtcct 


gcacgagtgg 


660 


ccgaccgcat 


ccaccgcgca 


gacgatgatg 


gcgaacctca 


aggctgggaa 


cttgggattc 


720 


tcagcgaaag 


gcgaatccgc 


aggcatgggc 


ggcatgggca 


tgcaacaaaa 


cggggacatg 


780 


aacggcatgg 


gcatgatggg 


cgcggcggac 


ggcacgggtc 


agggaggaaa 


tagcgcgagc 


840 


gacatcgcgc 


acgtgttgcc 


tggccccccg 


cttcaactga 


acggtttttc 


gccgaccgca 


900 


aacgattggg 


ctgcgcttga 


cgaaatggcg 


ggcatgtatg 


acgccttcac 


ggtgaatcag 


960 


aacgcgagcg 


gtacaacgct 


cttgccagcc 


aagccgggac 


agctcgttcg 


gcttcgcatc 


1020 


gtgaacagcg 


gcaacatgac 


acacctgttc 


acgctggtcg 


gcgcaccgtt 


tcgcgtcgtg 


1080 


gcgctcgacg 


gccacgacat 


tgccaacccc 


ggttggatcc 


gcggcgtctt 


gcttcccgtc 


1140 


ggcgctgcag 


agcgatacga 


catcgaattt 


cgcgtgccaa 


agtccggggc 


cgcattcctt 


1200 


gtgtgcgccg 


atcccgacac 


gactgcacag 


cgcgagcttc 


gcgccgecat 


cggtctgccc 


1260 


gacgcctggt 


cacaattcaa 


ggagacggat 


gcagcgagcc 


ttgaacgagc 


gccgtggttc 


1320 


gactttacac 


actatggcag 


cggcaggctg 


cccggcgaag 


ccgtgttccg 


cctgcatcag 


1380 


gcgtatcagg 


tacgctacaa 


catgaagctc 


accgtcggca 


tgtcgatgaa 


cggcatggtg 


1440 


tacgccatca 


acggcaaggt 


ctttccgaac 


atcccgccca 


tcgtcgtgcg 


aaagggcgac 


1500 


gccgtcctgg 


tccacatcgt 


gaacgacagc 


ccctacattc 


acccgatgca 


tctgcacgga 


1560 


cacgactttc 


aagtgctgac 


gcgcgatggg 


aaacctgtct 


ccggaagccc 


catcttcctg 


1620 


gacaccttgg 


acgtgttccc 


cggcgagagc 


tacgacatcg 


cgtttcgcgc 


cgacaacccg 


1680 


ggtttatgga 


tgtttcactg 


tcacgatctc 


gaacacgccg 


cggccggtat 


ggacgtcatg 


1740 


gtccagtacg 


cgggcatccg 


cgatccctac 


ccgatgagcg 
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<210> 11 

<211> 735 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_feature 

<222> (1)..(735) 

<223> CDS 

<220> 

<221> mi sc_feature 

<222> (1)..(87) 

<223> sig_peptide 

<220> 

<221> mi sc_feature 

<222> (88). . (735) 

<223> mat_peptide 



<400> 11 
atgaaacgtc 


ggaccttgct 


tgcgggcatc 


acgctggcgg 


cgctcgtcgc 


ggtggcgggc 


60 


tgtggcacgc 


cggccggtaa 


caccgcctcg 


ccggacaaca 


cagcgaactt 


gtcgaacacg 


120 


aacgcgccgg 


acacgctgtc 


caatgaaacc 


ggccagacgc 


tcgatacggc 


caacccgccg 


180 


tacctgcaca 


cgtcgaccga 


gcagtggaag 


agcatgccga 


agatgttcat 


caacccgaac 


240 


aagacctatg 


acgccattgt 


ccacaccaat 


tacgggacgt 


tcaccatcca 


gctgttcgcc 


300 


aaagacgcgc 


ccatcacggt 


gaacaacttc 


gtgttcctgg 


cagagcacaa 


cttctaccac 


360 


gattgcacgt 


tcttccgcat 


cgtgaagaac 


ttcgtgattc 


aaacgggcga 


tcctcgcaac 


420 


gacggtaccg 


gcggcccggg 


ctacaccatc 


ccagatgaac 


tcagccatca 


ggtgccattc 


480 


acgaagggca 


ttgtcgcgat 


ggccaacacg 


ggccagccgc 


acacgggcgg 


aagccagttt 


540 


ttcatctgca 


cggccaatga 


cacgcaggtc 


ttccagccgc 


ccaacaatcg 


ctatacggaa 


600 


ttcggccgcg 


tgatctccgg 


aatggacgtg 


atcgacaaga 


ttgccgccat 


cccggtgacc 


660 


gaaaacccca 


tgacgcagga 


agacagctat 


cctctgaaga 


ctgcgtacat 


cgagtcgatt 


720 


caaattcaag 


aatcg 










735 



<210> 12 

<211> 1824 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> mi sc_feature 

<222> (1) . . (1824) 

<223> CDS 

<220> 

<221> mi sc_feature 

<222> (1) . . (81) 

<223> sig_peptide 

<220> 
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10406 . ST25 

<221> misc_feature 
<222> (82). . (1824) 
<223> mat_peptide 

<400> 12 



gtgaagaagg 


gaaagagatg 


gtccgccgcg 


ctcgcgacgt 


ccgtggccct 


gtttgccacc 


60 


ctgtcgcccc 


aagcgctcgc 


cagcgacacc 


gtggttccgc 


aagtgaacac 


gctcacgccc 


120 


attcatcacc 


tcgtcgtcat 


cttcgacgag 


aacgtctcct 


ttgatcacta 


tttcgccacc 


180 


tatccgaacg 


ccgccaatcc 


agccggcgag 


ccgccctttt 


acgccgcgcc 


gggcaccccg 


240 


agcgtcaatg 


gcctgtccgg 


aagccttctc 


acgcacaatc 


ccaacggcgt 


gaatccgcag 


300 


cgcctcgacc 


gttcccaagc 


cgtgacgccg 


gacatgaacc 


acaactacac 


gccggagcag 


360 


caggccgtgg 


acgggggccg 


catggataac 


tttatcaata 


cggtcggccg 


cggaaatccc 


420 


atcgatctcg 


actactacga 


cggaaacacg 


gtcaccgcgc 


tctggtatta 


cgcgcaacac 


480 


ttcgccttga 


acgacaacgc 


gtactgcacg 


cagtacggcc 


cgtctacgcc 


tggcgccatc 


540 


aacctgattt 


cgggcgacac 


cgcgggagcg 


acggtttatt 


cttcaagtga 


gaccagcggc 


600 


gccgcacaag 


tcgtgccacc 


cggcagcaaa 


aactttccga 


atgccgtgac 


gccaaacggc 


660 


gtcgacatcg 


gcgacatcga 


tccctactac 


gacagcgcct 


ccaaaggcat 


gaccatggcg 


720 


atggccggca 


aaaacatcgg 


cgacctgtta 


aacgcgaagg 


gggtcacctg 


gggctggttc 


780 


cagggcggct 


ttgcaaatcc 


gaacgccaag 


gacaacaata 


tcgccggcac 


agatgaaacc 


840 


accgattaca 


gcgcacacca 


tgagccgttc 


cagtattatg 


cgtctacggc 


aaatccgaat 


900 


catctgccgc 


ctacgagcgt 


ggcgatgatc 


gggcgcacgg 


atcaggcaaa 


ccaccagtac 


960 


gacatcacga 


atttcttcca 


agcattgcaa 


aacggaaaca 


tgcccgccgt 


gagtttcctg 


1020 


aaagctcccg 


aatacgaaga 


cggtcacgcc 


ggctattccg 


atcccctcga 


cgaacagcgc 


1080 


tggctggtcc 


agaccatcaa 


tcaaatcgag 


gcgtcgcccg 


attggtcctc 


caccgccatc 


1140 


atcatcacct 


atgacgactc 


ggatggttgg 


tacgatcacg 


tcatgcctcc 


gctcgtgaac 


1200 


ggatcgagcg 


acaaggccgt 


ggacgtgctc 


ggtggcacgc 


cggttctgca 


aaacgggacc 


1260 


gacagggcgg 


gctatggacc 


gcgggtgccg 


ttcctcgtca 


tctcgcccta 


cgccaaacac 


1320 


aattttgtcg 


ataacacgct 


catcgaccag 


acttccgttc 


tgcggttcat 


cgaggagaac 


1380 


tggggcctcg 


gctcgttggg 


cccagcgtcg 


tacgactcgc 


tcgccggatc 


gatcatgaac 


1440 


atgtttgact 


ggaacacgca 


gaacccgcct 


gtgtttctcg 


atccgacgac 


cggtgaaccc 


1500 


gtgtccccag 


atatgcagcc 


ggaggtcatt 


cgcggcacca 


cgtatctcag 


cctgaatcac 


1560 


tacgctcaaa 


acctcgatgt 


cgtgctgcaa 


acctctcggg 


ggatggcgcg 


gttctcctac 


1620 


gaggggcacg 


aggtcgagat 


cgacgagcgt 


tccgggcttg 


tccgggtcga 


tggcgaagcg 


1680 


gtccatctca 


aggcgcctct 


tgtgcgggtg 


gacggcgtat 


ggatggtgcc 


cgtagaggaa 


1740 


atggattcgc 


tcattggggc 


cacgctgcac 


acctacaccg 


acggtcatct 


cacctactat 


1800 


ctcttttctc 


cgcaagacgc 


ccat 








1824 
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<210> 13 
<211> 750 
<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_feature 

<222> CD . . (750) 

<223> CDS 

<220> 

<221> misc_feature 

<222> CD-.C75) 

<22 3> si g_pepti de 

<220> 

<221> mi sc_f eature 

<222> (76) . . (750) 

<223> mat_peptide 



<400> 13 
atgctgagct 


tgtggaagcg 


aatccgaacg 


ggaacactct 


cacttctggc 


tgcatgcgcg 


60 


tgcgcgctgt 


cggcgatggg 


cgctggggca 


ggatgggtgc 


atgcggctga 


gtcccaagcg 


120 


caagccccaa 


gggccattta 


caaggtggac 


acgaaggaaa 


aggtggtcgc 


tctcacgttc 


180 


gacatctcat 


gggggcaccg 


cacgcccgaa 


ccggttctcg 


agacactcaa 


gaagtgcggc 


240 


gtgaccaagg 


cgacgttttt 


cctgagcggt 


ccttggacca 


tgcaccacgc 


ggacatcgca 


300 


aagaaaatca 


aggcgatggg 


ctacgaaatt 


ggcagccatg 


ggtacctgca 


caaggactat 


360 


tccaattacc 


cggactcttg 


gattcgagaa 


caggcgatgc 


tcgcagacaa 


ggccattcaa 


420 


caggtcactg 


gggtcaagcc 


gaagctgttc 


aggacgccaa 


atggcgactt 


gaatccgcgc 


480 


gtcatccgct 


gcctgacgag 


catgggctac 


acggtggtcc 


aatggaacac 


cgattcgctt 


540 


gactggaaaa 


acccaggcgt 


cgacgcgatc 


gtcaaccgcg 


tcacgaagcg 


cgtggtgcct 


600 


ggcgatatca 


tcctgatgca 


cgcgagcgac 


tcgtccaaac 


agattgtgga 


ggccctgccg 


660 


cgcatcattg 


aatcgcttcg 


gcagcagggc 


taccggttcg 


tcaccgtctc 


cgagctgttg 


720 


gcgggcgcca 


gcgttcaatc 


caaggtccag 








750 



<210> 14 

<211> 972 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> mi sc_f eature 

<222> (1) . . (972) 

<223> CDS 

<220> 

<221> mi sc_f eature 

<222> (1). .(63) 

<223> sig_peptide 

<220> 

<221> mi sc_feature 

<222> (64) . . (972) 
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<223> mat_peptide 












<400> 14 
atgcggaaga 


cggctgcagg 


eg eg xg cgee 


ctggcgctga 


tgggggtctt 


gggcggttgg 


ou 


gcgggcgcgg 


ccggcacggc 


gg igaacgcg 


cacgcgccgg 


cggcgtcggc 


gecaag xg 




tcggcacatg 


tgtgggaaga 


ag Lcagccgc 


acgtggggaa 


cgcttcccgt 


cgaxgcccgc 


i an 

JLov 


cacgacggcg 


tgtggcacaa 


catccccggt 


ttgtcaggct 


ttgegctega 


t r— r* n i*~ ri n f~ #"i 

cacggcggcg 


Z*fU 


agcgagcgcg 


agaccgcgcg 


gcgccatgac 


ggcgcgctcc acctggtatg 


gcgaaccc ti 


3UVJ 


ccgccgaagc 


gaaqactcqq 


agacctttcg 


cccgacgtga 


tttaccgegg 


ccccgcgcag 


joU 


gagaagtcqq 


tqqcqctqat 


ggtgaatgtg 


tcctggggcg 


atgcgtacgt 


geccaggarg 




cttgaggtgc 


tqcqcaqcqc 


gcacgtgaag gccacgtttt tcgtggacgg 


cgcgtttgcg 


4oU 


aagaagttcc 


ccgatctcgt 


ccgcgcgatg 


gcgcgagacg 


ggcacgcggt 


cgagtcccac 


C /t n 
}4U 


ggctttggac 


acccagactt 


tcgccggctg 


agegacgega 


agctcgccgc 


ccagcttgac 


bUU 


gagacgaatc 


gagtgctcgc 


cggcatcacg 


ggcaaggttc 


cacggctcat 


cgcgcctccg 


bbU 


gccggatcgt 


atgatgcgcg 


cctggctccg 


ctggcgcatt 


cgcggcgcat 


gtacgccatc 


720 


ctgtggaccg 


cggataccgt 


ggactggaaa 


aacccgcctg 


eggatgtcat 


cgtccaacgc 


780 


gttcagcgcg 


gtgcggaacc 


cggcgcgttg 


atectgatge atcccacggc 


gcccacggcg 


840 


gaggccctgc 


ctgatgtgat 


ccgctggctc 


gaggggcacg 


gttategget 


gaaaacggtg 


900 


gaggacgtga 


tcgacgaacg 


cccagcggtc 


acccctccga 


cgacgctggc 


gaacgagacg 


960 


ttccacagcg 


eg 










972 



<210> 15 

<211> 642 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> mis cofeature 

<222> CD . . (642) 

<223> CDS 

<220> 

<221> misc_featiire 

<222> (1) . . (87) 

<223> sig_peptide 

<220> 

<221> mi sc_f eatu re 

<222> (88).. (642) 

<223> mat_peptide 

<400> 15 

atgatgcgtt ggaattggaa ggttgctgtg ggatcgttgg cgttggccgc actgggegea 60 

ggggeggegg tgtcgccggt gtttgcggcg gcgaagtcgt cgaaggccgc gcagtcccac 120 

geagaggega gcgcggcagt cgtgatggct gggaagctgt aeggcaacat tccgaacgtc 180 

accattcgcg gcgtggaagc tgggaaggcg ccgtgggtcg tggaeggate gtaccagctg 240 

page 15 



10406 . ST25 



aagagcaacc 


tgttcacggc 


gagtgggaag 


tggctcatca 


ttccgaagca 


gggctatatg 


300 


gagaacggtc 


agccggttcc 


ggccaaaatt 


ggcggcacga 


cgaacaacat 


tccggccgtc 


360 


ggggccgaaa 


tcacgtttgc 


aaacgcggco 


cccattgtgt 


tgccgccggt 


caagctgtcg 


420 


agccaaggtg 


acttctcgtt 


ccacgacgcc 


atccagtggc 


cgaagggtgc 


cgcgcagccg 


480 


gtcatcctga 


ttgggcccga 


gaagaacggt 


cagctcgtcg 


cgtggtttgc 


ggcgtcggac 


540 


ttcctcgccg 


actacggcca 


ggcgacgggc 


ataaacancQ 




cacaacacat 


600 


ccagagactc 


ccgtgcggca 


cacccacctc 


gcttcgaaga 


ag 




642 


<210> 
<211> 
<212> 
<213> 


16 

771 

DNA 

Alicyclobacillus sp. 










<220> 
<221> 
<222> 
<223> 


mis cofeature 
(1). .(771) 

CDS 












<220> 
<221> 
<222> 
<223> 


mi sc_feature 
(1) . . (63) 
sig_peptide 












<220> 
<221> 
<222> 
<223> 


misc_feature 
(64).. (771) 
mat_peptide 












<400> 16 
atgaactggg 


cgcgtgtcgg 


cgcgtgggta 


tccacctggc 


tggtggctac 


ggcgcttgga 


60 


gctggctgtg 


ggacggcttc 


gcaagagcat 


ccgtccaaca 


cctccacgtc 


agatcaccgc 


120 


gttgcgcccg 


cggcgccagg 


cggctccgcc 


tcgatgcaaa 


accggcatat 


tctgcaggag 


180 


ccgctgccgc 


gtggcgtgaa 


aacggaaacg 


gatttgtaca 


actggctttt 


atggcagaga 


240 


ctcgccgaga 


tcaacaatcc 


ggcgcagggt 


gaaatctgcc 


tggacgccgc 


atgcaagatt 


300 


gcggccaccg 


tcttttctgg 


cccggccaag 


gccgcggccg 


gcacgcctgt 


cactctggtg 


360 


gcgttttcgc 


cgcgggcggg 


ttggcaggtg 


ctcgtgggtc 


cgctgcccca 


gtcggacaac 


420 


cctccgcgtc 


aagcacaatc 


catcacaggc 


cagtctgcgc 


gactacccgc 


gcaaagaggg 


480 


cgtatgcgtc 


gttcaaaccc 


acgaaatcga 


ctggtactgg 


attcaggacg 


gacacctgca 


540 


gctgatgcgt 


cagccgcgcg 


catgacgcgt 


cagctaaggc 


gatccgccag 


ctcgacgaac 


600 


gcgtcgagat 


cgcgcagggc 


aaagtcgatg 


gcgcgctgcc 


aaaagtcagg 


ttgcgtgaga 


660 


tccgcaccga 


tgtgtttttg 


ggccagatcc 


tcgacccgca 


tgcgaccggt 


gtcgcgaagc 


720 


aacgccacat 


acttgtccgc 


aaatcccgtg 


ccttccgctg 


aggccatggc 


a 


771 



<210> 17 
<211> 3390 
<212> DNA 

<213> Alicyclobacillus sp. 
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<220> 

<221> misc_feature 

<222> (1) . . C3390) 

<223> CDS 

<220> 

<22l> mi sc_feature 

<222> (1) . . (72) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (73).. (3390) 

<223> mat_peptide 

<400> 17 



ttgaaacgca 


cactgagtgg 


cattgcttca 


gctgcaattg 


ttctgggtgc 


gattagcccg 


60 


atggcgtttg 


cgcagacctc 


gtccagcggt 


ctcacgccgg 


ccggtcagtt 


gcctatcgtc 


120 


gtcaatggac 


aggttctgtc 


gaacccgtat 


gagatggtgg 


gcatggactc 


cggcaacaag 


180 


acgggcttct 


tcccgattta 


ctactttgac 


caggcgcttg 


aaaagattgg 


catcacggcg 


240 


acctggaatg 


gtgcaaccca 


cacctgggcg 


ctgacggact 


ccaacgtcaa 


tgcttcgaac 


300 


gtccaagtcg 


cgggtggtat 


gggcacgggg 


aacaccacgg 


tgaccctgaa 


cggcacgccg 


360 


attaagatgt 


tctacaccca 


ggttgcgaag 


gacccggcgg 


gtggcccggt 


cacgacgtat 


420 


atgccgattt 


actatatcaa 


caacatcctg 


agtgcgcttg 


ggatccatgg 


aacctttagc 


480 


ggacagacgg 


gtctcaacat 


taccaccggg 


cagacgcttg 


ccggtagcct 


gagtgccatc 


540 


acggtgacgg 


gggcgacgag 


cggtacgggg 


acctcttcga 


gcccggctgt 


ggcgttgaat 


600 


aacggcaagg 


ttacgctctc 


gacgactctg 


acggattcga 


atggcaatcc 


gattggcaac 


660 


gcggcggtca 


ccttcaactt 


ctctgaatat 


ggtgcgctgc 


cttcgaatgc 


gccgacggtc 


720 


accaatgcgt 


cgggtgcgac 


aattccggcg 


accaccggct 


cgacggctta 


tcagtacacg 


780 


gtctacacca 


actccagcgg 


tgtggcttcg 


atcacggtgt 


ctgggcccgt 


tggcttgacc 


840 


tacgcatacc 


aggtgactgc 


gacggcgccg 


atcagcaatg 


gcagcaatca 


aatgattagc 


900 


agccagccgg 


cgtatgtcga 


gtttgtcgcc 


aacaaccagg 


cgggtattgc 


gccgtacggc 


960 


acggcttctc 


aaccgtactc 


ggcttcgctg 


ggtaccgcag 


ttcccarcac 


ggtgattttg 


1020 


ccgccgggtg 


cgaacggtca gccgcaggcg 


aatgtgctcg 


tgaccctgtc 


gctgagcaac 


1080 


ccgaatggtg 


gcaccaacta 


tgcatacttc 


accaactcgt 


cgggtgcgaa 


tctgggcacg 


1140 


caaatccagg 


tgacgaccaa 


ctcgtcgggt 


gtggcgcaag 


cgtgggtcag 


cgacgcgaac 


1200 


gcgcagcctg 


ttgtcgtgac 


ggccaatgtg 


tcgaatgcga 


ccaatgtcag 


caacacttcg 


1260 


gtgagcacct 


acctgaactt 


tggtcaggca 


ggcgtgccag 


catcgatcgc 


caattacaac 


1320 


gatccgtatt 


cggctttggt 


ggccaacggt 


cagcagccgc 


tcgccggtac 


gacggtgacg 


1380 


attacgggta 


cgctcgtaga 


cgctgcaggc 


aacccggtgg 


ccaacggtca 


ggtgctrgta 


1440 


accggctcgt 


cgtccagcgg 


cgacttcggc 


tatgtcacga 


cgtccaacgg 


caagagcacg 


1500 
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acgaccgact 


tcccgagcgt 


gggtacgttg 


cagcctggtc 


agcctgtgag 


ctccgcgctg 


1560 


ggtgacgtca 


tcacggcgga 


tgcgaacggc 


aacttctcgt 


tgcaagtcac 


agacacgcag 


1620 


aacgagcaag 


ccagcctgac 


gttctactcg 


gtgagcaacg 


gggtcattag 


cccggtgggg 


1 C O f\ 

1680 


gtcattaaga 


ccgacacgct 


gaaattcgca 


gtgaacaatc 


agctgtcgac 


cattgcgctg 


1740 


ggtgcgacgg 


acgctcaagc 


ggacggcaac 


cagtacacga 


atctgacggg 


tctcacgggt 


1800 


tcggacaatg 


cgccggtgcc 


ggtgtatgtg 


gatccgcaga 


atccgtcggg 


cacaatggtg 


1860 


accaatcaga 


gcatcaccta 


tacgctcagc 


gtcagcagcg 


gcgacatcgt 


gggcattggc 


1920 


tctggtgcgt 


atctggcgcc 


gaccaatgcg 


aacaacagca 


cgattccgat 


caacagcggc 


1980 


aacggcctca 


gctccgtcca 


ggtcacggtc 


acggcattgg 


gcaacaacca 


ataccagatc 


2040 


tcggtgcccg 


gtcagcaagg 


cgtgttgacg 


acctcgtcgc 


ctgactttac 


ggtgctggtg 


2100 


aaaggctcga 


cgggttcgac 


gaagctgacg 


gtcagctccg 


gctcactctc 


gtcgacggca 


2160 


accatcacct 


tcacgtcgag 


caacccgacg 


gtggtggcta 


gcctgacgcc 


agtttcctcg 


2220 


gtgttggcgg 


ctggtcagaa 


cgagacggtc 


accttcaccg 


tggaagatgc 


agatggcaat 


2280 


ccggtgagcg 


gtaatacgca 


ggttgccatc 


acggcgcatg 


acagcaatga 


tccgttgtgg 


2340 


atcaccgcag 


tgaatggcac 


aaacttgagc 


gagtatgaga 


cgattaatgg 


tgctgcaacg 


2400 


tctgtcagca 


cgccgattcc 


gctcggtacg 


agttcgtatg 


caacctctgg 


tggttctacg 


2460 


ctctacccgg 


cttacacgaa 


cagcgggtac 


tttaagaatg 


gtgtgagcat 


cagcggtgtc 


2520 


gtatcgtggg 


atggtacggt 


gggcgatcca 


atctacgtca 


ccaccaactc 


gcaaggccaa 


2580 


gtcacgctga 


ccttgcaaaa 


cggcaacgtg 


acctattttg 


acggaaacaa 


caccacgttg 


2640 


tcgaatggca 


tcagcgttgc 


cggtacgagc 


ggaagtgaag 


ggttctacac 


atattcgagc 


2700 


gataccgcag 


cgacagcgtc 


ggatcttaca 


aatatgggcg 


tgttggtcat 


tggtcaagcc 


2760 


aatggtgacg 


cttcaacgtc 


gctcggaacg 


atttacatcg 


gcagtggtgg 


tgctacgcag 


2820 


acaccggccg 


ccttcaccta 


cgtggatgcc 


aataaccact 


cttacacgta 


ctcgaacacg 


2880 


agcgatacat 


ttacggtatc 


tagcacccag 


agtgttagcg 


gtggcaacta 


tgcgatcaca 


2940 


agcttcacgc 


cagttggagg 


tactgcaact 


tctacaatcc 


cgagtggcgt 


gagcgtaaat 


3000 


agctcgacgg 


gtacggtttc 


ggtgtcccaa 


aacgctgcag 


tcggtacgta 


caccgtgagc 


3060 


tattacctga 


acggcgtcac 


tgaatccact 


ggcacgttca 


aggtgtactc 


cggcagcggt 


3120 


gtggctccta 


cagagatcac 


tggctcgtca 


gtgacggttc 


ctgctgcaac 


gtactcgggt 


3180 


acgttgaaag 


tcacggtaag 


caacggtggt 


tcgccgctgt 


acgtgaacgt 


taccgctgga 


3240 


gaatcggcca 


atgcggtggc 


tgcagctatt 


tacaacgcgc 


ttgtcaatgc 


caatatcagc 


3300 


ggagatacct 


tctctgtttc 


gggttcgaca 


gtcagcgtga 


ccgctgcgag 


cggttcgccc 


3360 


acgctcacag 


ttgtcgatgc 


gaccaatttc 








3390 



<210> 18 
<211> 744 
<212> DNA 
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<213> Alicyclobacillus sp. 



10406. ST2 5 



<220> 

<221> misc_feature 

<222> (1)..(744) 

<223> CDS 

<220> 

<221> mi sc_f eature 

<222> (1). .(123) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (124). .(744) 

<223> mat_peptide 



<400> 18 
gtgcgaatta 


tgaaagtttt 


gggatggatt 


ttggtaccgt 


atatcatgct 


gtttattcag 


60 


tgggggcgaa 


tgaacagaat 


tctgcgtttt 


gccggttcat 


tgtgggcatt 


aattgtcttc 


120 


gcgaacacgg 


tgtatatgat 


tcgaggaaac 


acaccgcgga 


acgcatcaac 


ggtaagcgct 


180 


acaacttctt 


tggttaattc 


gacgaatagt 


tcacaggtag 


caaagcaaga 


gcaaaactcg 


240 


agtacgtctc 


ccgctcataa 


gtctacgaac 


tcattgcaac 


atgcgcaaca 


tcaagctgct 


300 


acgacttcat 


cttctcagtc 


gaagttacga 


tatatcccgt 


ttcacacata 


cgggaaggta 


360 


ggagacttgg 


aaattagagt 


taactccctg 


cagcaagtta 


agagtgtggg 


gtacgacggg 


420 


ataggtgaaa 


ccgcaaatgg 


tgcgttttgg 


gttatcaaca 


tcaccataag 


aaatgacgga 


480 


tccactccta 


tggaggtcgt 


tgatggcata 


ttccatttgc 


agaacttaaa 


cgggaacgtt 


540 


tatcagccgg 


attctactgc 


tgagatatat 


gcaaatacaa 


attcagggac 


tattccgacc 


600 


gacctcaacc 


ctggtgtgtc 


catgacgaca 


aatctcgtat 


ttgatatgcc 


ggattttatg 


660 


acatatggtc 


acgtcgggca 


gcattactca 


cttgtcgctt 


ccatgggttt 


cttcgggtca 


720 


gatgaaacga 


cgtatgctct 


tccg 








744 



<210> 19 

<211> 516 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_feature 

<222> (1)..(516) 

<223> CDS 

<220> 

<2 2 1> mi sc_f eatur e 

<222> (1). .(75) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (76).. (516) 

<223> mat_peptide 

<400> 19 
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atgaaccgca 


aatccatgtt 


gtctgtgttg 


ggtgtggcag 


ccgcagtagc 


cctgatggtg 


60 


acgggctgtg 


gcacggccaa 


cagcacgaac 


aacacggcgt 


cgagcggtgc 


ggccagcaca 


±ZU 


gccgtcacgg 


tgaagcacga 


gcacaagggg 


gccaatgctt 


cgaagacaga 


gacgaagcag 


1 OA 


accgaagcga 


agtcgtcgaa 


caaggctgga 


gaaacggcga 


agtcgtcggt 


gaagctcacg 




gccccggtgg 


caggcgcgac 


ggtgacggcc 


ggcggcacgc 


tgaaggtgag 


cggccaagtg 


300 


tcgtcgaacc 


tcgcgaagaa 


ggacgtgcaa 


attacgttga 


caaatagcgc 


gaagaaggtg 


360 


ctcgtgcagc 


agatcgtcgg 


tacgaatagc 


accggcgcat 


tcgtggacac 


gctcaagctt 


420 


ccaaagtacc 


ttgggaaagc 


cggaagcgac 


ctgacgctgt 


cggtgtccgt 


cgttggcgaa 


480 


aatggagtcg 


taagcacctt 


gtcgctgcac 


gtgaag 






516 



<210> 20 

<211> 726 

<212> DNA 

<213> Mi cyclobacillus sp. 



<220> 

<221> mi sc_f eature 

<222> CD.- (726) 

<223> CDS 

<220> 

<221> mi sc_f eature 

<222> CD ■ . (90) 

<223> sig__peptide 

<220> 

<221> mi sc_feature 

<222> (91) . .(726) 

<22 3> mat_peptide 

<400> 20 



atgaggcgcg 


cggttcgtat 


actagctgcg 


ctactgtttg 


ggctggcgac 


ggtaacagcc 


60 


acattgatgt 


tcgtgcctca 


ggcaagagcg 


gccacggtga caggagcgtt ggcgcaatcg 


120 


caagtggtgt 


ccattacggg 


cggctacaac 


acgacgacac 


agatgtatga 


gcagacgggt 


180 


cagcaaaccg 


tcgttacgaa 


ttggaccttt 


tctcttcaac 


aaactgtcaa 


ccaaaacaac 


240 


gagaatccgt 


cctacgctca 


atgcacagtc 


ttggcgggaa 


accagcaggt 


aacgtgcacg 


300 


tcggacgcta 


cgaataacgg 


tgcaatttgc 


acatccccct 


atcctggagc 


tattgacaag 


360 


caatgcacga 


acctgattgg 


gttcactgga 


aacatatcag 


tgagttcgca 


aaacggcaat 


420 


ccaacgttca 


ctttttctct 


tccgagcatc 


gacccgagta 


ccatgaagcc 


agttgggatc 


480 


tttgtgacgc 


ctgagacgat 


ctatggtcag 


atgggaacag 


ggtccgaaag 


ttatttaagc 


540 


tcaggtcaat 


ctggaggatg 


gtcatttaac 


ttttccaacg 


tctcagatcc 


tcaagattgg 


600 


tattttctcc 


ttgagttttt 


ggcgaatcca 


attgtcgcgg 


ccattgctgt 


gcccaccact 


660 


caaacggttc 


cgatttatag 


ctgggtcacc 


accacggttt 


ggcaccccgt 


tcaaatttcc 


720 


tacagc 












726 
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<210> 21 
<211> 540 
<212> ONA 

<213> Alicyclobacillus sp. 



<220> 

<221> mi sc_feature 

<222> (1) . . (540) 

<223> CDS 

<220> 

<221> mi sc_feature 

<222> (1)..(72) 

<223> sig_peptide 

<220> 

<221> misc_feature 

<222> (73). .(540) 

<22 3> mat_pept i de 



<400> 21 
gtggttcgga 


tgcgcaagcg 


gttgggactt 


gttctgagta 


tggtgacatc 


tgtgttggtt 


60 


ggatgtggcg 


cttcacatcc 


gtctccattg 


aaccaagaca 


aatctttgtt 


gacgtggaac 


120 


gctgctaaac 


acgaggtgcg 


gtggaaagtg 


gtcgccggcg 


acggacgcgc 


aaacggcggt 


180 


atgaacttcg 


atggctatgc 


caatggcagt 


atgacactgg 


tcgtgccgat 


tgggtggcgc 


240 


gtcgtgatcg 


actttgacaa 


tgccagtttg 


atgccgcaca 


gcgcgatggt 


ggtgccttac 


300 


ggagatcgcg 


aacgctccaa 


cttcgacgca 


acgatggttg 


cgtttccagg 


cgcagaaacg 


360 


cccaatccgt 


cacagggaga 


ccctcaaggg 


acgcatcggg 


atgtcatctt 


cactgctgcg 


420 


aaggtgggaa 


cgtatgccct 


cgtctgcggg 


gtcccgggtc 


acgcgctggc 


gggaatgtgg 


480 


gatcagcttg 


tggtgtccga 


tgaagcgaaa 


cacccgtccc 


ttcgcgtgca 


acgcgactca 


540 


<210> 
<211> 
<212> 
<213> 


22 

1431 
DNA 

Al i cycl obaci 11 us s p . 










<220> 
<221> 
<222> 
<223> 


mi sc 
(1). 

CDS 


,_feature 
.(1431) 












<220> 
<221> 
<222> 
<223> 


mis cofeature 
(D.-C75) 
sig_pepti de 












<220> 
<221> 
<222> 
<223> 


misc_feature 
(76). .(1431) 
mat_peptide 












<400> 22 
atggcggttc 


gtagagcgtg 


gcttctggcg 


cccttgtgcg 


cgagcagtct 


ggtcgtcccg 


60 


gcctcggtgc 


aggccggatt 


ggcccaggga 


catggcagct 


tttcgacggt 


tcgcgtgtcc 


120 


gtggggacgt 


cgagttccct 


gtccgtcccc 


gcgctgattc 
Page 21 


agggaaacga 


aacgtacatt 


180 
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ccgctgtggg 


acctcatgca 


ggtgctccat 


cagctcggct 


tcaccgcgac 


gtgggcgaag 


240 


ggccaattca 


gcgtttcggc 


cccgccatcg 


gtgccgatgg 


acgaggcgcc 


tgggccagcg 


300 


ggcaaaggcg 


gggcgctcgt 


ggtgctcgac 


gggcaagtcg 


tggaacaggt 


gccgacggtc 


360 


atcgccacgc 


caccgggggc 


ggccacccct 


gaggtgtttc 


tgccgctcac 


gaacgcggag 


420 


gagatcctcg 


gtcggttggg 


cattcaggcc 


agcgcgaccg 


gcaatcaggt 


gaacctcgac 


480 


gcgtcggctg 


tgccccaggc 


gcttcccaac 


cagcaggtgg 


ctgtgtggaa 


cgtgcttgcc 


540 


gctgttgcgt 


ccgatctcgg 


cgtgtcgacc 


gcgccagccg 


ggccgagtcc 


ctacgccgac 


600 


ttgccgacag 


cctcgccggc 


gtggggcgcg 


gtggaggcgg 


ccattcgtct 


gggctggtat 


660 


tcgcccttat 


ccgcgtcgtc 


atccggcgcg 


tttcaaccca 


tcacgtgggc 


gcaaacggca 


720 


tccattctgt 


ggaatgcgct 


cggcatttca 


cagcaggacg 


cggcgtacca 


gccaggcgga 


780 


tcgccgacgg 


cgtgggcgag 


cgcccttggc 


cttgttccag 


aaaactggga 


tccagcgtcg 


840 


tacatgaccg 


cgcaggaatt 


ggacaccttg 


gcgtcgaatt 


tgcacgaatg 


tctgcaagga 


900 


gatgtcgaaa 


cgggcgccaa 


cacgtggcgg 


ctctggtatc 


cgccggctga 


cgaagtggag 


960 


gctaccctcc 


agtcgggagg 


cgggcagtcg 


ctgttcacct 


cgaccgctga 


cgcgcaggcc 


1020 


gccatctcgt 


cagcctacca 


attcttcaat 


cagcttgtgg 


tcacaagagt 


cggccaaggg 


1080 


tatgtcgtca 


ccgttccctc 


tgtgcctgag 


ggatatgggt 


ttgccacctt 


ttctgcgctc 


1140 


ggcggtgtgg 


cttaccagac 


gacacccggc 


ggtccgtgga 


cggtcgtgcc 


cgtgctggac 


1200 


acgcgcgacg 


tctccatccc 


ggccaagggc 


cgtctcagtg 


tcaaggttcc 


cgcgcagggc 


1260 


atcaccatca 


cgtggaatca 


gatgatgcca 


tcgctgggcg 


gaacggtggc 


catgggcgcg 


1320 


ctccaggtgt 


cgcctggacc 


cagcgggcct 


tcggtcgagc 


gcttgaatat 


cgtcacaccg 


1380 


aacttacctc 


cggtccttcc 


gtcgtccgtc 


acttctacgc 


aaccgcagtc 


a 


1431 



<210> 23 

<211> 1020 

<212> DNA 

<213> Alicyclobacillus sp. 
<220> 

<221> misc_f eature 

<222> (1)..(1020) 

<223> CDS 

<220> 

<221> mi sc_f eature 

<222> (1)..(57) 

<223> sig_peptide 

<220> 

<221> mi sc_f eature 

<222> (58). . (1020) 

<223> mat_peptide 

<400> 23 

gtgaatcgac agtggaggct agcggtggcg acttctgccg tcgcggccag cctcgcgggg 60 
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tgtggagcac 


cggacctcgc 


ggcgatgcgg 


10406. ST2 5 
ccgacggtcc aaaagtctgc 


ggtactcgtg 


120 


gaggtcgtgg 


gcgcgccgcc 


gtttgcgccc 


tcagcttcac 


aactqqqaac 


ggcaggggcc 


180 


acctccgtcg 


aggtggttca 


cgttgccctt 


gqcqaatqqc 


agtctgtcgc 


ggcccacgca 


240 


ttggcgaagg 


ggcaattgac 


aggggtcatg 


OtCQtQtaCQ 


acgacgcgaa 


cgccgtcgcg 


300 


tctggcctca 


accaacttgc 


tgccgaccat 


cccgacgttc 


gctttctcgt 


ggtcagcaac 


360 


tggccggctt 


cgcaaatcac 


ctccggaaac 


qtqaaaaaca 


tcgcacagga 


tcctgtggcc 


420 


gtcgcttaca 


gcattggcgc 


gctgtgcgga 


gactggatcg 


cgagctcaac 


gtcgacgagc 


480 


ggagcggtat 


acagcggcgt 


gcccagcatc 


atctiacacoc 

y t- v. *- m. v. y v» y 


cacacaatoc 


gaccgtggct 


540 


gaacaaaaag 


ccttcttcac 


gggtctgtat 




rcaat'at'cca 


ggtcgtcgcg 


600 


cttccgcagc 


ccgctgcgca 


gagcctgtcg 




acacQataaa 
a y yy ^yy 


tttgggtgtg 


660 


gtaggcgggt 


ctcctgcggc 


aggggaactg 


tcggcgcttc 


gcagtgccgc 


ccccgcctgg 


720 


gctgcttttg 


gaacgtcgcc 


gatcgctggc 


tttgcgattt 


ctcctggcca 


tctgtcgtcg 


780 


tcggaggccg 


tgcaagcatt 


ccaggcgctc 


gtgtcgccgg 


acgcgtggca 


ctcgggtgag 


840 


catctcgtgc 


tcgacttgtc 


ttcggtggcc 


ttcgacgaca 


agcaggtgcc 


cgcgaccgtc 


900 


atcgcggcgt 


gggccaagct 


ggaggtcaac 


gcgatcgcgg 


ctgcagcgca 


atcgaacgcg 


960 


gccttcgcgt 


cactgccgcc 


gagcgtgcgc 


tcggacctcg 


ccaatgcgtt 


tcatttgtca 


1020 


<210> 
<211> 
<212> 
<213> 


24 

1023 

DNA 

AlicyclobacilTus sp. 










<220> 
<221> 
<222> 
<223> 


mi sc_feature 
CI).. (1023) 
CDS 












<220> 
<221> 
<222> 
<223> 


misc_feature 

CD-.C87) 

sig_peptide 












<220> 
<221> 
<222> 
<223> 


misc_feature 
C88) . . (1023) 
mat_peptide 












<400> 24 
atggtcatgc 


gcactcggtg 


gattcgatgg 


at aactttoa 




ctgtgtctgg 


60 


ctcagcccgt 


ttcccttctc 


gtggggcgcg 


acgagcctcg 


acgctgatct 


tccacaaccc 


120 


acgattccgc 


catccgcgtg 


gagcaacctc 


aatcaggact 


ggaaggacct 


tcagcgcttg 


180 


gcgcaaaaca 


cagtgccgcc 


ctcgaaagag 


agcagccaga 


cccacgcgcc 


cacacacaag 


240 


tcatcgcaac 


cgcctgccca 


agtcccgcaa 


gggccgctcg 


tcggggtcgg 


cgatacgggc 


300 


gaagcggccc 


ggtggttaaa 


cgaagccttg 


gccgtgctcg 


gctatttgcc 


cgccgtcttc 


360 


tctcccgcgg 


cgcagacgtc 


cacccgtcag 


gtgcggctcg 
Page 23 


cactcgcggc 


gagcgccgag 


420 



10406. ST2 5 



catcagacgc 


tcgtgcccat 


cccaggctcg 


tttcaacttc tgtatcacgc 


gccaagctcg 


480 


tgggtggcgc 


tctggtccgc 


cgacgaagac 


acgccgatca 


cggagggcgc 


cgtcatggcg 


540 


tttgaagcac 


aacatcacct 


gggcgtggat 


ggcattgccg 


ggccggacgt 


cattcatgcg 


600 


ctggcgcagg 


ccctcgccgg 


caatgagacg 


gcagaaaagg 


cgccctacag 


ctacatcctg 


660 


gtgaccacgt 


cgttgcccga 


gacgctcgaa 


ctctgggtga 


atggccagct 


tgtcctcaaa 


720 


tcgctgtgca 


acacaggcat 


cgcgcagtca 


cccacgccgt 


atggcacgta 


cggcgtctac 


780 


gtgcagtaca 


cgtcgcagga 


aatgaagggc 


aaggatccgg 


acggcacgcc 


ctacgacgat 


840 


cccggcgttc 


catgggtgag 


ctacttctac 


aaaggttgcg 


cggtccacgg 


tttcctgcgg 


900 


gcaaagtacg 


gctrtcccca 


gagcctcggt 


tgcgtggaac tgccgtatgc 


cgcggccaaa 


960 


acggtgttct 


cctatacgca 


catcggcacg 


cttgtcaccg 


tcaccgcctc 


cccgctttcc 


1020 


gcg 












1023 



<210> 25 

<211> 1197 

<212> DNA 

<213> Alicyclobacillus sp. 



<220> 

<221> misc_f eature 

<222> (D..CH97) 

<223> CDS 

<220> 

<221> mi sc_f eatu re 

<222> CI)- . (84) 

<223> sig_peptide 

<220> 

<221> mi sc_f eature 

<222> (85). .(1197) 

<223> mat_peptide 

<400> 25 



atggataggc 


tgctgaacaa 


caaggtggcg 


cttcgcctga 


ccgcgctcgt 


cctcgcgtgc 


60 


attctctggc 


tcgccgtgca 


cgcggagcag 


gggtcggggt 


cctccgcgtc 


cacgggagtg 


120 


accgagtcgt 


tcgagctgcc 


ggtgcgggtg 


gaaacctcgg 


ccgacgaggt 


gttggtgtct 


180 


caagttccga 


ccatcaccgc 


ccgggtgacg 


acgaacctgt 


tgagcctgcc 


gacgctggcc 


240 


tcggatatga 


tgaaagccga 


gatcgtcgcg 


gacgccgaaa 


atctgggccc 


gggcacgtac 


300 


acgttgcacg 


tggcggccgt 


caacatgcct 


gcaggggtgc 


gatcgtacac 


gctaacgcct 


360 


tccaccatca 


cggtgacgtt 


ggagcccaaa 


gtgacggtgg 


agcgaacggt 


gcgggtgaac 


420 


gtggtcggca 


cgccagggca 


gggatatgtc 


ctcggcaagc 


ccgagctcgg 


cgcgggggtc 


480 


gtcgaggtct 


cgggcgccga 


atccagtgtg 


caggccgtgg 


ccgaggtggc 


gggcgtcgtg 


540 


gacgcgagcg 


gcctgtcgca 


gacggcgacc 


aagctcgtcg 


agttgttgcc 


gcttgaccaa 


600 


gcgggcaagg 


cggtgccggg 


tgtgacggtc 


acgccatccg 


cgatttcggt 


cacgctgccg 


660 
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dTcacgxccg 


ccaa xcaggc 


ggxgaagcxg 


acgccxgcyy 




rrrtnrorrt 


720 


gga xacgccg 


tcgcc teg g l 


y cacc xygag 


cccyv.ydy^y 


r t* n "t* rt n a a a 


yyyy<- L<i y*-y 


780 


ft t~ f~ ^ r\ r t~ r\ f~ , 




tyyyt LV.V.LV. 


y Lyt-Lua l v_y 






840 


cccacgacy y 




nnt'nrrnrt't 

gg LgLLgc l x 


L Lgt-t-y yy y d 


l y dcy ay i_y i_ 




900 


gcagtgacgg 


ccgtgatcga 


cgtggagccg 


tccgccgtct 


acaccgtttc 


gaacgtcccg 


960 


gtggccatca 


cgggcgcgac 


gggtgtcaag 


ctggtgacgc 


ctcggaccgt 


gaatgtcacg 


1020 


gtgacgggga 


tcgaggccga 


cgtgcgcgcg 


gtggagaggg 


atccggccgc 


ggtgcaggcg 


1080 


tttgtggacg 


cgaccgggtt 


gacacatggc 


tcggcgacgc 


tgcccgattc 


aaattcgtct 


1140 


gctgtcctgt 


ctcttgtgat 


ccggccacgg 


gaaaggcgta 


agcgaacaca 


tgtagtg 


1197 



<210> 26 

<211> 959 

<212> PRT 

<213> Alicyclobacillus sp, 
<220> 

<221> SIGNAL 

<222> CD-.C24) 

<220> 

<221> mat_peptide 
<222> (25). .(959) 

<223> acid endoglucanase or acid cellulase 
<400> 26 

Met Lys Thr Arg Trp ser Gly Ala Leu Ala val Leu lie Ala Leu Gly 
-20 -15 -10 

Thr Gly Ala ser Pro Ala Trp Ala Ser val His ser Ala Ala Thr His 
-5 -11 5 

Ala Lys Ala His val Gly val Arg Ala Ala Asp Met Ala Ala Ala Ser 
10 15 " 20 

Met Ser Ala Glu lie Gin lie Leu His Asp Ala Leu Thr Ala Ser Glu 
25 30 35 40 

Leu ser Ser val Gin Ala Ala Ala Gin Ala Ala Ala Asn Leu Pro Ala 
45 50 55 

Ser Thr Trp Val Ser Trp Leu Tyr Pro Ser Ala Ser Ser Pro Ser Ala 
60 65 70 

Ala Gin Thr Gin Thr Ala Gin Ala Leu Gly Ala Leu Leu Thr Leu val 
75 80 85 

Thr Tyr Gly Ala val Ala Asp Asp Gly Gin Asn lie Ala Gin Asn Leu 
90 95 100 
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Gin Thr Leu Gin Ser Thr Ser Pro Leu Leu Ser Pro Ala Ala val Ser 
105 110 115 120 

Met Phe Tyr Gin Asn Phe Phe Val Leu Val Gly Gin Ser Ser Lys Ser 
125 130 135 

val Leu Ser Gly Gin Ala Thr Thr ser Thr Ala Gly His Ala Leu Ala 
140 145 150 

Gin Ala Ala Ala Leu Thr Pro Gin Leu Ala Ala Tyr Leu Arg Gin Ser 
155 160 165 

Gly Leu ser Pro Asp Asp Leu Ala Arg Ala Tyr val ser Phe Ala Ser 
170 175 180 

Ala val Asp Ser Gin Gly Ala Ala Gin Thr Ala Leu Leu Thr Arg lie 
185 190 195 200 

Cys Thr Asn lie Leu Gly Phe Gly Ala Pro Thr Ser Thr Ala Thr He 
205 210 215 

Thr val Asn Ala Ala Ala Asn Leu Gly Gin val Pro Thr Thr Ala Phe 
220 225 230 

Gly Leu Asn Ala Ala val Trp Asp ser Gly Leu Asn ser Gin Thr val 
235 240 245 

He Ser Glu val Gin Ala Leu His Pro Ala Leu He Arg Trp Pro Gly 
250 255 260 

Gly Ser lie Ser Asp val Tyr Asn Trp Glu Thr Asn Thr Arg Asn Asp 
265 270 275 280 

Gly Gly Tyr val Asn Pro Asp Asp Thr Phe Asp His Phe Met Gin Phe 
285 290 295 

Val Asn Ala Val Gly Ser Thr Pro He lie Thr Val Asn Tyr Gly Thr 
300 305 310 

Gly Thr Pro Gin Leu Ala Ala Asp Trp val Lys Tyr Ala Asp val Thr 
315 320 325 

His His Asp Asn val Met Tyr Trp Glu lie Gly Asn Glu lie Tyr Gly 
330 335 340 

Asn Gly Tyr Tyr Asn Gly Asn Gly Trp Glu Ala Asp Asp His Ala Val 
345 350 355 360 

Ala Gly Gin pro Gin Lys Gly Asn Pro Gly Leu Ser pro Gin Ala Tyr 
365 370 375 
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Ala Gin Asn Ala Leu Gin Phe lie Lys Ala Met Arg Ala Glu Asp Pro 
380 385 390 

ser lie Lys lie Gly Ala val Leu Thr Met Pro Tyr Asn Trp Pro Trp 
395 400 405 

Gly Ala Thr val Asn Gly Asn Asp Asp Trp Asn Thr val val Leu Lys 
410 415 420 

Ala Leu Gly Pro Tyr lie Asp Phe val Asp val His Trp Tyr Pro Glu 
425 430 435 440 

Thr Pro Gly Gin Glu Thr Asp Ala Gly Leu Leu Ala Asp Thr Asp Gin 
445 450 455 

lie Pro Ala Met val Ala Glu Leu Lys Arg Glu Val Asn Thr Tyr Ala 
460 465 470 

Gly Ser Asn Ala Lys Asn lie Gin lie Phe Val Thr Glu Thr Asn Ser 
475 480 485 

val Ser Tyr Asn Pro Gly Glu Gin Ser Thr Asn Leu Pro Glu Ala Leu 
490 495 500 

Phe Leu Ala Asp Asp Leu Thr Gly Phe lie Gin Ala Gly Ala Ala Asn 
505 510 515 520 

val Asp Trp Trp Asp Leu Phe Asn Gly Ala Glu Asp Asn Tyr Thr ser 
525 530 535 

Pro ser Leu Tyr Gly Gin Asn Leu Phe Gly Asp Tyr Gly Leu Leu Ser 
540 545 550 

Ser Gly Gin Thr Thr Gin Asn Gly Trp Gin Glu Pro Pro Ala Asn Thr 
555 560 565 

Pro Leu pro Pro Tyr Asn Gly Phe Gin Leu Val Ser Asp Phe Ala Gin 
570 575 580 

Pro Gly Asp Thr Met Leu Gly Ser Thr Thr ser Gin Ser Ala lie Asp 
585 590 595 600 

val His Ala val Arg Lys Pro Asn Gly Asp lie ser Leu Met Leu val 
605 610 615 

Asn Arg Ser Pro ser Ala lie Tyr Ser Ala Asn Leu Asn Val Leu Gly 
620 625 630 

Phe Gly Pro Phe val Val Thr His Ala Leu Ala Tyr Gly Glu Gly Ser 
635 640 645 
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Ser Arg val Ala Pro Met Pro val Leu Pro val Pro Gly Ala Pro He 
6S0 655 660 

Lys Leu Met Pro Tyr Ser Gly lie Asp Leu Thr Leu His Pro Leu lie 
665 670 675 680 

pro Ala Pro His Ala Ala Ala Gin val Thr Asp Thr Leu Thr Leu Ser 
685 690 695 

Ser pro Thr val Thr Ala Gly Gly Ala Glu Thr Leu Ser Ala Ser Phe 
700 705 710 

Gin Ala Asp Arg Pro Val His His Ala Thr val Glu Leu Glu Leu Tyr 
715 720 725 

Asp Ser Thr Asn Asp Leu val Ala Thr His Thr val Ser Asp val Asp 
730 735 740 

Leu Gin Pro Gly Ser Ala Thr Ser Glu Thr Trp Ser Phe Thr Ala Pro 
745 750 755 760 

Ala Ala Asn Gly Asn Tyr Arg val Glu Ala Phe val Phe Asp Pro val 
765 ^ 770 775 

Thr Gly Ala Thr Tyr Asp Ala Asp Thr Gin Gly Ala Val Leu Thr val 
780 785 790 

Asn Gin Pro Pro Gin Ala Thr Tyr Gly Asp He Val Thr Lys Asp Thr 
795 800 805 

val lie Thr Val Asn Gly Thr Thr Tyr Asp val Pro Ala Pro Asp Ala 
810 815 820 

Gly Gly His Tyr Pro Ser Gly Thr Asn He Ser Val Ala Pro Gly Asp 
825 830 835 840 

Thr val Thr val Gin Thr Thr Phe Val Asn val Ser Ser Thr Asp Ala 
845 850 855 

Leu Gin Asn Gly Leu lie Asp Met Glu val Asp Gly Ser Asn Gly Ala 
860 865 870 

Xle Leu Gin Lys Tyr Trp pro Ser Thr Thr Leu Leu Pro Gly Gin Ser 
875 880 885 

Glu Thr Val Thr Ala Thr Trp Gin Val Pro Ala Asn Val Ala Ala Gly 
890 895 900 

Thr Tyr Pro Leu Asn Phe Gin Ala Phe Asn Thr Ser Ser Trp Thr Gly 
905 910 915 920 
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Asn cys Tyr Phe Thr Asn Gly Gly val val Asn Phe val lie Ser 
925 930 935 

<210> 27 
<211> 272 
<212> PRT 

<213> Alicycl ©bacillus sp. 
<220> 

<221> SIGNAL 
<222> CD-. (32) 

<220> 

<2 21> mat_pepti de 

<222> (33).. (272) 

<223> aspartyl protease 

<400> 27 

Met Asn Gly Thr ser val Trp Lys Ala ser Gly lie Ala Ala Ala Ser 
-30 -25 -20 

Cys Leu Thr Ala Ala Ala Leu Leu Ala Trp Pro His Ala Thr Ser Thr 

-15 -10 -5 -1 

Leu Asp Ala ser Pro Ala lie Phe His Ala Pro Arg His Ala Leu Ser 
1 5 10 15 

Pro Asn Thr Ser Pro Lys pro Asn Ser val Gin Ala Gin Asn Phe Gly 
20 25 30 

Trp ser Ala Ser Asn Trp Ser Gly Tyr Ala val Thr Gly Ser Thr Tyr 
35 40 45 

Asn Asp lie Thr Gly ser Trp lie val Pro Ala val ser Pro Ser Lys 
50 55 60 

Arg Ser Thr Tyr Ser ser ser Trp lie Gly lie Asp Gly Phe Asn Asn 
65 70 75 80 

ser Asp Leu lie Gin Thr Gly Thr Glu Gin Asp Tyr val Asn Gly His 
85 90 95 

Ala Gin Tyr Asp Ala Trp Trp Glu lie Leu Pro Ala Pro Glu Thr Val 
100 105 110 

lie Ser Asn Met Thr lie Ala Pro Gly Asp Arg Met Ser Ala His lie 
115 120 125 

His Asn Asn Gly Asn Gly Thr Trp Thr lie Thr Leu Thr Asp val Thr 
130 135 140 

Arg Asn Glu Thr Phe Ser Thr Thr Gin Ser Tyr Ser Gly Pro Gly Ser 
145 150 155 160 
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Ser Ala Glu Trp lie Gin Glu Ala Pro Glu Ile Gly Gly Arg lie Ala 
165 170 175 

Thr Leu Ala Asn Tyr Gly Glu Thr Thr Phe Asp Pro Gly Thr val Asn 
180 185 190 

Gly Gly Asn Pro Gly Phe Thr Leu Ser Asp Ala Gly Tyr Met val Gin 
195 200 205 

Asn Asn Ala Val Val Ser val Pro Ser Ala Pro Asp Ser Asp Thr Asp 
210 215 220 

Gly Phe Asn val Ala Tyr Gly ser Asn Gin Pro ser Pro Pro Ala Ser 
225 230 235 240 

<210> 28 

<211> 315 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CI) . . (25) 

<220> 

<221> mat_peptide 

<222> (26).. (315) 

<223> multi copper oxidase 

<400> 28 

Met Arg Arg Arg Met Ser Gly Phe Ala Thr Gly Leu Gly Ile Ala Ala 
-25 " ~ -20 -15 -10 

Gly Leu Ala Leu Ser Ser Ala Leu Ala Ala Pro Phe Phe His Ala Gly 
-5 -11 5 

Asn Ala Ser Ala Ala Ser Thr Met Ser Met Ala Pro Thr Ser Thr Met 
10 15 20 

Gly Ala Leu Pro Ala Pro Glu Gly Val Pro Asp Ala Gly Pro Leu Ser 
25 30 35 

ile Thr Pro Glu val lie Arg Gin Gin Gin Ala Asp Ala Val Arg val 
40 45 50 55 

Met Asp Glu Glu Gly Leu Lys Pro Gin Ile Leu Ser Gly Asp Ile Lys 
60 65 70 

Arg Phe Thr Leu Thr Ala Ser Gin Val Asn Trp Tyr Leu Tyr Pro Gly 
75 80 85 

Lys Ala val Val Ala Cys Gly Tyr Asn Gly Gin val Pro Gly Pro Val 
90 95 100 

Page 30 



10406. ST2 5 

Leu Arg val Arg Val Gly Asp Arg val Gin lie Leu Leu Arg Asn Glu 
105 110 ^ 115 

Leu Asn Glu Pro Thr Thr Leu His lie Gin Gly Leu Asp Leu Pro Ala 
120 125 130 135 

ser Gin Leu Gly lie Gly Asp val Thr Glu Ser pro lie Pro Pro Gly 
140 145 150 

Gly Glu Arg Leu Tyr Ser Phe Thr Val Thr Pro Gin Met val Gly Thr 
155 160 165 

His Leu Tyr Glu Ser Gly Thr Asp Met Ala Ser Glu lie Asp Pro Arg 
170 175 180 

Thr Ala Arg Gly Ala Ala Arg Arg ser Gly Pro Gly lie Pro Leu Ser 
185 190 195 

Pro Gly Glu Gly Gly Arg Ala Leu Arg Asp Arg Arg val Asp Gly Gly 
200 205 210 215 

Arg lie Asp His Arg Lys Arg val Trp Pro Gly Arg Gin Ala val Ser 
220 225 230 

Arg Arg Ala Arg Thr Asp Gly Ala val Arg Gin Pro Arg Gly Ala Ala 
235 240 245 

His Arg Gin Arg Glu Arg Asp val Leu Pro Arg His Ala Pro Ala Arg 
250 255 260 

Asp Asp val Leu Ala Ala Gly Gly Arg Arg Ala Pro Pro Arg Gin Ala 

265 270 ~ 275 

Ala Ala Asp Glu Arg Ala Arg His Arg Ala Arg 
280 285 " " 290 

<210> 29 

<211> 626 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CD.. (32) 

<220> 

<221> PROPEP 
<222> (33).. (189) 

<220> 

<221> mat oeptide 

<222> (190).. (626) 

<223> serine-carboxyl protease 
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<400> 29 

Met Gly Leu Trp Lys Arg Leu Ala Leu Gly val Pro Ala Ala Leu 
-185 -180 -175 

ser Met Leu Ala val Gly val Pro val Met ser Ala Asp Thr Val 
-170 -165 -160 

Glu Ala Ala Pro Leu Ala Asn Pro ser Thr Glu Asn Ala Gin Asp 
-155 -150 -145 

Met Gly pro Ala Ser Gly Ser Gin Thr val Thr Ala ser lie lie 
-140 -135 -130 

Leu Arg Val Gin Asn Pro Thr Ala Leu Gin Asn Tyr lie Gin Glu 
-125 -120 -115 

Thr Glu Thr Pro Gly Ser Pro Leu Tyr His Lys Phe Leu Thr Thr 
-110 -105 -100 

Ala Gin Phe Ala Gin Gin Tyr Ala Pro Ser Ala Ala Thr Leu Gin Gin 

-95 -90 -85 

He Glu Gin Glu Leu Gin Gly Tyr Gly Leu Gin val val Asn val Asp 
-80 -75 -70 

Ala Asp His Leu Asp Met Gin val Gin Gly Thr Val Gin Gin Phe Asp 
-65 -60 -55 

Asn Ala Phe Asn Thr val lie Asp Leu Phe Lys Ala Asn Gly His lie 
-50 -45 -40 

Phe Arg Ala Pro Lys Lys Pro Pro Gin lie Pro Val Ala Leu Leu Thr 

-35 -30 -25 -20 

Asn Val Leu Ala val val Gly Leu Asp Thr Ala Gin Ala Ala Gin Ser 
-15 -10 -5 

Leu Thr val Lys Thr Pro Asn Val Ala Gly val Pro Ser Pro Lys Val 
-11 5 10 

val Leu Pro Gin Gly Gly Ser Thr Ala Thr Gly Thr Pro Gly Ser Tyr 
15 20 25 

Thr Val Gly Asp Thr Ala Asn Arg Tyr Asp lie Asn Pro Leu Tyr Gin 
30 35 40 45 

Lys Gly lie Thr Gly Lys Gly Glu Thr lie Gly lie val Thr Leu Ser 
50 55 60 

Ser Phe Asn Pro Gin Asp Ala Tyr Thr Tyr Trp Gin Gly lie Gly Leu 
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65 70 75 

Lys val Ala Pro Asn Arg He Gin Met Val Asn val Asp Gly Gly Gly 
80 85 90 

Gin Met Asp Asp Gly Ser val Glu Thr Thr Leu Asp val Glu Gin Ser 
95 100 105 

Gly Gly Leu Ala Pro Asp Ala Asn Val Val Val Tyr Asp Ala Pro Asn 
110 115 ; 120 125 

Thr Asp Gin Gly Phe lie Asp Ala Phe Tyr Gin Ala val Ser Asp Asn 
130 135 140 

Gin Ala Asp ser Leu Ser val Ser Trp Gly Gin Pro Glu lie Asp Tyr 
145 150 155 

Leu Pro Gin Met Asn Gin Gly Gin ser Tyr val Asp Glu Leu Leu Ala 
160 165 170 

Phe Thr Gin Ala Phe Met Glu Ala Ala Ala Gin Gly lie Ser Met Tyr 
175 180 185 

Ala Ala Ala Gly Asp Ser Gly Ala Tyr Asp Thr Ala Arg Asp Phe Pro 
190 195 200 205 

pro ser Asp Gly Phe Thr Thr Pro Leu Ser Val Asp Phe Pro Ala Ser 
210 215 220 

Asp Pro Tyr lie Thr Ala Ala Gly Gly Thr Thr val Pro Phe Thr Ala 
225 230 235 

Lys Phe Ser Leu Gly Thr val Asn lie Thr Gin Glu Gin Pro Trp ser 
240 245 250 

Trp Gin Tyr Leu Gin Asn Leu Gly Tyr Gin Gly Leu Phe Ser Val Gly 
255 260 265 

Thr Gly Gly Gly Val Ser val lie Phe Pro Arg Pro Trp Tyr Gin Leu 
270 275 280 285 

Gly Val Gly Gly Met Gin Asn ser Ala Ala Asn Gin Ala Phe Thr Asp 
290 295 300 

Ser Gin Gly val Leu Tyr Gly ser Pro Phe Thr Tyr Asn Leu Pro ser 
305 310 315 

Asn Tyr Ala Gly Arg Asn Leu pro Asp lie Ser Met Asp Ala Asp Pro 
320 325 330 

Glu Thr Gly Tyr Leu val Tyr Trp Ser Ala Gly Gly Gly Trp lie Ala 
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335 340 345 

Gly Tyr Gly Gly Thr ser Phe val Ala pro Gin Leu Asn Gly He Thr 
350 355 360 365 

Ala Leu lie Asp Gin Glu Val His Gly Arg val Gly Phe Leu Asn Pro 
370 375 380 

Leu Leu Tyr Thr Leu Leu Thr Gin Gly val Gin Gly Gly Ala Gin Pro 
385 390 395 

Phe His Asp lie Thr Thr Gly Asn Asn Trp Tyr Trp Asn Ala Val Pro 
400 405 410 

Gly Tyr Asp Pro Ala Ser Gly Val Gly Thr Pro Asp val Ala Asn Leu 
415 420 425 

Ala Gin Asp lie Ala Ser Leu Arg 
430 43 5 

<210> 30 

<211> 533 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CD - .(24) 

<220> 

<221> mat_peptide 

<222> (25).. (534) 

<223> seri ne-carboxyl protease 

<400> 30 

Met Arg Ala Leu Ala His Leu Ala lie Gly Ala lie Ala ser Gly val 
-20 -15 -10 

Phe Ala Ala Pro Val Ala Phe Ala Ser Pro val Gin Glu Arg val val 

-5 -1 1 5 

val Ala ser Pro Asp Pro Arg Thr Arg ser val His Ala Asp Gly Glu 
10 15 20 

lie Ser Pro Ser Gin Pro Met His Leu val lie Thr Leu Arg Leu Arg 
25 30 35 40 

His Glu Ala Gin Leu Glu Gin Leu lie Arg Asp Leu Tyr Thr Pro Gly 
45 50 55 

Ser Pro Asp Ala Gly His Phe Leu Thr Pro Ala Ala Phe Asn Ala Ala 
60 65 70 
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Tyr Ala Pro Thr Ala Glu Asp val Gin Ala Val val Gin Gly Leu Arg 
75 80 85 

Ala Tyr Gly Leu Arg val Glu Pro Thr Val Asn Pro Met Val Leu Thr 
90 95 100 

val Ser Gly Arg Ala Arg Asp val Glu Arg Ala Phe Gly Val His Glu 
105 110 115 120 

Leu Gin Phe Gly Arg Gly Ala Gly Ala Trp Tyr Ala Pro Asp Gly Ala 
125 130 135 

Ala Thr Leu Pro Ala Pro Leu Ala Ala Arg val Ser Ala val val Gly 
140 145 - 150 

Leu Thr Ser Asp Ala Met Glu Arg His Leu Val Leu Ala His Val Ala 
155 160 165 

Pro Ala Gly Gly Gly Tyr Thr Pro Ala Gin lie Gin Arg Ala Tyr Asp 
170 175 180 

Tyr Thr Pro Leu Tyr Ser Gin Tyr Met Gly Arg Gly Gin Val He Ala 
185 190 195 200 

val val Thr ser Gly ser val Leu Arg ser Asp Leu Leu Ala Phe Asp 
205 210 215 

Arg Ala Phe Gly Leu Pro Asn Pro val val Arg Gin Arg val lie Asp 
220 225 230 

Gly Ser Ser Thr ser Pro Asp Asp Glu Thr Thr Leu Asp cys Glu Trp 
235 240 245 

Ala His Ala lie Ala Pro Thr Ala Ser Leu Ala Val Tyr Glu Ala Ala 
250 255 260 

Gin Pro Asp Ala Gin Ser Phe lie Asp Ala Phe Ala Gin val Ala Ala 
265 270 275 280 

Asp Asp Gly Ala His val Val Thr Thr Ser Trp Gly Ala Pro Glu Ser 
285 290 295 

Glu Thr Asp Ala Ala Thr Met Gin Ala Glu His Gin lie Phe Met Gin 
300 305 310 

Met Ala Ala Gin Gly Gin ser val Phe Ala Ala Ala Gly Asp Ser Gly 
315 320 325 

Ser Ser Asp Gly Thr Ser Gly Thr Asp val Asp Tyr Pro Ser Ser Asp 
330 335 340 
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Pro Tyr val Thr Ala Cys Gly Gly Thr Arg Leu Val Leu Gly Ala Gly 
345 350 355 360 

Ala Lys Arg Leu Gin Glu Thr Ala Trp Ala Asp Thr Gly Gly Gly Ala 
365 370 375 

Ser Ser val Tyr Gly Glu Pro Trp Trp Gin Tyr Gly Pro Gly val Pro 
380 385 390 

Gin Thr Gly Tyr Arg Gin Thr Cys Asp Val Ala Leu Asn Ala Asp Pro 
395 400 405 

Ala Thr Gly Tyr Asp Phe He Tyr Glu Gly Gin Trp Glu val Ala Gly 
410 415 420 

Gly Thr Ser Phe val Ala Pro Met Met Ala Ala Thr Phe Ala Leu lie 
425 430 435 440 

Asp Gin Ala Arg Ala Leu Glu Gly Lys Pro Pro Val Gly Leu Ala Asp 
445 450 455 

val Gly lie Tyr Ala Met Ala Arg Asn Ala Ser Tyr Ala Pro Tyr Ala 
460 465 470 

Phe His Asp lie Thr Ala Gly Ser Asn Gly Ala Tyr Ser Ala Gly Pro 
475 480 485 

Gly Trp Asp His Pro Thr Gly Phe Gly Ser lie Asp Ala Tyr Tyr Phe 
490 495 500 

Leu His Gly Leu Asp 
505 

<210> 31 

<211> 360 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> (1)..(41) 

<220> 

<221> mat_peptide 
<222> (42).. (411) 

<223> protease or a HtrA-like serine protease 
<400> 31 

Met Arg Arg Arg Arg Trp Asp Tyr Glu Asp Trp Pro ser Glu Asn Arg 
-40 -35 -30 

Arg val Gly Val Trp Leu Ala Ser Gly Thr Ala Leu Leu Ala lie Cys 
-25 -20 -15 -10 
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Tyr He Leu Gly lie Trp Thr Gly Ala Ala Leu Thr Arg Gly His Ser 
-5 -11 5 

Gin Thr Thr val Glu Tyr Val Pro Pro Gin Thr Gly Asn Thr Ala ser 
10 15 20 

Thr ser Gly Ser Leu Thr Pro lie Pro Gly val Glu Asp Thr Thr lie 
25 30 35 

val Thr Gin lie Tyr Asn Arg Val Lys Asn Ser lie Phe Thr lie Thr 
40 45 50 55 

Ala val Ser Gly Gly Lys Pro Thr Ser Ser Asp Ala Glu Glu Asp He 
60 65 70 

Gly Thr Gly Phe Leu lie Asp His Asn Gly Asp Leu Leu Thr Asn Ala 
75 80 85 

His val val Gly ser Ala Thr Thr val Gin val ser Gly Asp Asn Arg 
90 95 100 

Gin Phe Val Gly Arg val He Asp Ala Asp Gin Leu Asp Asp Leu Ala 
105 110 115 

lie Val Arg lie Pro Ala Pro Lys Ser Leu Glu Pro Leu Pro Leu Gly 
120 ' 125 130 135 

Ser val Lys Ser Leu Gin Pro Gly Ser Leu Val lie Ala lie Gly Asn 
140 145 150 

Pro Phe Glu Leu Thr Ser Ser val Ser Ser Gly lie Val Ser Gly Leu 
155 160 " 165 

Asn Arg Ser Met Ser Glu Ser Asn Gly His Val Met Asn Gly Met lie 
170 17 5 180 

Gin Thr Asp Ala Pro Leu Asn Pro Gly Asn Ser Gly Gly Pro Leu Leu 
185 190 195 

Asn Ala Ala Gly Gin val val Gly lie Asn Thr Leu lie Glu Ser Pro 
200 205 210 215 

lie Glu Gly ser lie Gly lie Gly Phe Ala lie Pro lie Asp Arg Phe 
220 225 230 

lie Gin Leu Glu Pro Glu Leu Leu Ala Gly Lys Pro Val Ala His Ala 
235 240 245 

Trp Leu Gly lie Glu Gly Met Asp lie Asp Asn Leu Met Arg Gin Ala 
250 255 260 
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Leu His Leu Pro val Ala ser Gly val Tyr val Thr Glu val Thr Pro 
265 270 275 

Gly Gly Pro Ala Ala Lys Ala Gly Leu Arg Gly Asp Ser Asn Ala Ala 
280 285 290 295 

Lys Leu Asn Ser Leu Ser Gin Ser Ala Asn Pro Tyr Ala Leu Leu Lys 
300 305 310 

Gly Asn Gly Asp lie lie val Gly 
315 

<210> 32 
<211> 211 
<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> (1)..(30) 

<220> 

<221> mat_pepti de 

<222> (31).. (212) 

<223> disulfide isomerase 

<400> 32 

Met Arg Arg Ser Trp Ser val Leu Met Ala val Cys Met Ser Trp Leu 
-30 -25 -20 -15 

Ala Val Gly Cys Gly Thr Pro Ala Asn ser Leu Ser Gin Ala Thr Ala 
-10 -5 -1 1 

Ala ser Gly Arg His Ala Pro His Pro Leu val Phe Gin Asn Leu Thr 
5 10 15 

Gly Ala Met Asn Glu Gly Gin Asp Pro Arg Trp Asp Pro Lys Ala Ala 
20 25 30 

Pro Thr Gly Val Tyr Asp Asp Val Thr val val Thr Ala Ser Gly Arg 
35 40 45 50 

Gin Glu Val Leu Ser val Arg Asp Ala Pro Leu Leu Phe Ala Ala Tyr 
55 60 65 

Trp Cys Pro His Cys Gin Arg Thr Leu Gin Leu Leu Thr Ser lie Glu 
70 ~ 75 80 

Ser Arg Leu Lys Gin Lys Pro lie Leu val Asn val Gly Tyr pro Pro 
85 90 95 

Gly Thr Thr Leu Gin Thr Ala Ala Arg lie Ala Arq Glu Glu ser Gin 
100 105 110 
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Val Leu His Leu Ala Pro Phe Gin Glu val Phe lie Leu Asn Pro Asp 
115 120 125 130 

Ala Gly Asp Arg Tyr Ala Pro Leu Gly Tyr Pro Thr Leu Ala Phe Tyr 
135 140 145 

Arg Ala Gly Arg Asp Trp Thr Leu Tyr Gly Glu His Arg Ala Ser lie 
150 155 160 

Trp Glu Lys Ala Leu ser Glu ser Thr ser Lys Ala Tyr Asn Gly ser 
165 170 175 

Glu Glu Ser 
180 

<210> 33 

<211> 266 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> (1)..(29) 

<220> 

<221> mat_peptide 
<222> (30). .(266) 

<223> gamma-D-glutamyl-L-diamino acid endopepti dase 
<400> 33 

Met Asp Glu Met Asn lie Arg ser Trp Cys val Ala Ala Cys Thr val 
-25 -20 -15 

Ala Leu Thr ser Ala val Gly Ala Thr Thr Ala Phe Ala Gin Thr val 
-10 5 -11 

Thr Val Gin Pro Gly Gin Ser Leu Trp Thr lie Ala Arg Ala His Gly 
5 10 15 

Met Pro Val Gin Leu val Ala ser Ala Asn Pro Gin Tyr Asn Pro Leu 
20 25 30 35 

Asn Leu Pro val Gly Ala Thr Val Thr Leu Pro Ser Leu Lys Asp val 
40 45 50 

Ala Val Gin Pro Gly Asp Ser Leu Phe Leu He Gly Arg Gin Tyr Gly 
55 60 65 

val Ser Leu Ala Glu Met Leu Ala Ala Asn Pro Asn val Asp Pro Leu 
70 75 80 

Asn Leu Gin val Gly Ser Ser val Arg Val Pro Leu Ala Ser Ser Ser 
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85 90 95 

Thr Lys ser ser Thr val Ser Ala His Val Ala Ala ser Thr Pro Glu 
100 105 110 115 

Asn Ser Asn Asn Leu Tyr Trp Leu Glu Arg val lie His Ala Glu Ala 
120 125 130 

Gly Gly Glu Ser Leu Gin Ala Gin lie Ala val Ala Asp val lie Leu 
135 140 145 

His Arg Met Ala Ala Gly Gly Tyr Gly Ser Thr val Gin Gin val val 
150 155 160 

Phe Gin Val Ser Asp Gly His Tyr Gin Phe Glu Ser val Ala Asn Gly 
165 170 175 

Ser lie Tyr Gly Gin Pro Asp Ala Gin Asn val Gin Ala Ala Leu Asp 
180 185 190 195 

Ala Leu Asn Gly Asp Asp val val Pro Gly Ala Leu Val Phe Tyr Asn 
200 205 210 

Pro Ala Gin Thr pro ser Gly Ser Trp val Trp Gin Gin Pro Val val 
215 220 225 

Ala His lie Gly His Leu val Phe Ala Lys 
230 235 

<210> 34 

<211> 768 

<212> PRT 

<213> Alicyclobaci 11 us sp. 
<220> 

<221> SIGNAL 

<222> CD-.C26) 

<220> 

<221> mat_peptide 
<222> (27) . . C768) 

<223> endo-beta-N-acetylglucosaminidase 
<400> 34 

Met Lys Thr His Arg Leu Leu Ala Val Ala Ala Leu Pro Ala Thr val 

-25 ~ -20 -15 

Leu Leu Thr Thr Pro Ala Pro Ala Leu Ala Glu Thr Ser Ser Ser Gin 
-10 -5 -11 5 

Ser Ala Ser Ala Pro Ser Leu Asn Val Pro val Ala Ala Leu Thr Leu 
10 15 20 
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Ala Gly val Gin Ser Tyr Pro Met Leu ser Tyr Gly Ser Thr Gly Val 
25 30 35 

Tyr Val Glu lie Leu Gin Asn Ala Leu Asn Ala Leu Gly Tyr Asp Val 
40 45 50 

Gly Gin Ala ser Gly Leu Phe Asp Ala Thr Thr Gin Ala Glu val Lys 
55 60 65 70 

Ala Phe Gin Gin Ala Met Gly Leu Gin Thr Asp Gly lie val Gly Pro 
75 80 85 

Leu Thr Trp Gly Ala Leu Ala Lys Ala Val Ala Asp Tyr Arg Gin val 
90 95 100 

Met Thr Val Leu Ser Ser Arg Ser Ser Leu Val Gin Gin Val Glu Trp 
105 110 115 

Lys Arg lie val Trp Asn Gly Arg Leu lie Ser Lys Pro lie Gly Phe 
120 125 130 

Thr Tyr Gin Gly Thr Ala Tyr Met Pro lie Trp Tyr Val Met Gin Ala 
135 140 145 150 

Leu Ser Lys Ala Gly lie Ala Ser Thr Trp Gin Gly Gly val Trp Thr 
155 160 165 

Leu Thr Pro Pro Gly Gly Gin Thr Val Asn Tyr Gly Lys He Ser Tyr 
170 175 180 

Gly Pro Gly Ser Ala Ala lie Ala lie Gly Gin Thr val val Ala Asn 
185 190 195 

val Pro Ala val Val Tyr Pro Asp Pro Ala Ser Gly Lys Leu Thr Thr 
200 205 210 

Phe Met Pro val Trp Tyr Val Met Asn Ala Leu Gin Arg Leu Gly He 
215 220 225 230 

Gly Ser Thr Trp Gin Gly Thr Glu Trp Asp Met Lys Pro Ala Pro val 
235 240 245 

val lie Glu Thr Gly Asp Pro Ser Asn Asn Thr Thr Gly Ser Asp Pro 
250 255 260 

Ala Asn ser Thr Gly Asn Gly Thr Gly Asn ser Thr Gly Asn Ala Thr 
265 270 275 

Gly Ala val Pro Gly Gly Asn Thr Val Thr Asn Val Thr Thr Gly Ser 
280 285 290 
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Ser Asn Val Thr Gly Asn Ser Thr Gly Asn Ser Leu Gly Asn Ser Thr 
295 300 305 310 

Gly Asn Ser Leu Gly Asn Ser Thr Ser Asn Ala Thr Gly Asn Ala Thr 
315 320 325 

Gly Asn Thr Thr Gly Asn Ala Thr Gly Asn ser Thr Gly Thr Ser Ser 
330 335 340 

Gly Ser Phe Thr Asn Val Asp Leu Arg Tyr Pro Ala Pro ser Asn lie 
345 350 355 

Asn Ala Gin Ser lie Asn Gin Phe Leu Leu Gin Asn Ser Ser Pro Leu 
360 365 370 

Asn Gly Leu Gly Asn Ser Phe Met Asp Ala Gin Asn Leu Tyr ser Val 
375 380 385 390 

Asp Ala Asn Tyr Leu Val Ser His Ala lie Leu Glu Ser Ala Trp Gly 
395 400 405 

Gin Ser Gin lie Ala Leu Gin Lys Asn Asn Leu Phe Gly Tyr Gly Ala 
410 415 420 

Tyr Asp Ser Asn Pro Gly Gin Asp Ala Gly val Phe Pro Ser Asp Asp 
425 430 435 

Tyr Ala lie Arg Phe Glu Ala Trp Thr val Arg Met Asn Tyr Leu Thr 
440 445 450 

Pro Gly Ala Ser Leu Tyr val Thr Pro Thr Leu Ser Gly Met Asn val 
4 55 460 465 470 

Asn Tyr Ala Thr Ala Lys Thr Trp Ala Ser Gly lie Ala Ala He Met 
475 480 485 

Thr Gin Phe Ala Ser ser Val Gly Ser Asn val Asn Ala Tyr Val Gin 
490 495 500 

Tyr Thr Pro ser Asn Asn Pro Pro Ala Pro Arg Ser Thr Ala Glu Pro 
505 510 515 

val Tyr Tyr Met Asn Gly Ala Gin Gly Val Thr Gin Gin Asp Pro Tyr 
520 525 530 

Tyr Pro Asn Gly Gly Val Pro Tyr Tyr Pro Thr lie Ala Gin Gly Glu 
535 540 545 550 

Asn Gin Gin Phe Phe Gly Gin Leu Ser val Gly Ser Phe Gly Gin Pro 
555 560 565 
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val val Glu val Gin Gin phe Leu Asn Arg Thr lie Asn Ala Gly Leu 
570 575 " 580 

Thr val Asp Gly Gin Phe Gly Pro Leu Thr Gin Ala Ala Val Glu Lys 
585 590 595 

Phe Gin Ser Gin val Met His Met Ser Asn Pro Asn Gly lie Trp Thr 
600 605 610 

Phe Ser Met Trp Val Gin Tyr lie Gin Pro ser Gin Ser Asn Ala Asn 
615 620 625 630 

Leu lie Pro Ala Gly Thr Thr Val Lys lie Asp Gin Val Ala Glu Gly 
635 640 645 

Met Ala Gly Pro Tyr Val Val Pro Trp Tyr His val val Gly Tyr Gly 
650 655 660 

Trp val Asp Ser Gin Tyr He Lys Leu Thr Asn Val Tyr Arg Val lie 
665 670 675 

val Gin Asn Pro Ala Gly Thr Ala Thr Thr lie Pro val Tyr Gin val 
680 685 690 

Gly Asn Leu Ser Ser val Leu Leu Asn Leu His Ser Gly Asp Trp val 
695 700 705 ' 710 

val Ala Asn ser Ala Gin Pro ser Gly Gly val Tyr Thr lie Gin lie 
715 720 725 

Ala Ala Gin Asp Pro Pro Cys Arg Thr Ala Thr Pro Pro Gly Arg Ser 
730 735 740 

<210> 35 
<211> 597 
<212> PRT 

<213> Alicyclobacillus sp. 

<220> 

<221> SIGNAL 
<222> CD. .(49) 

<220> 

<221> mat_peptide 

<222> (50).. (597) 

<223> multi copper oxidase 

<220> 

<221> MISC^FEATURE 

<222> (139). .(139) 

<223> putative copper binding site 

<220> 

<221> MISC_FEATURE 
<222> (141). .(141) 
<223> putative copper binding site 
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<220> 

<221> MI SC— FEATURE 
<222> (181).. (181) 
<223> putative copper binding site 

<220> 

<221> MI 5C__ F E ATU R E 

<222> (183).. (183) 

<223> putative copper binding site 

<220> 

<221> MISC_FEATURE 

<222> (514).. (514) 

<223> putative copper binding site 

<220> 

<221> MISC_FEATURE 

<222> (566).. (566) 

<223> putative copper binding site 

<400> 35 

Met Met Ala His Asp Arg Leu Asp Arg Arg val Asn Glu Arg Arg Gin 
-45 -40 -35 

Ala Met Arg Arg Ala Ala Lys Trp Ala lie Ala Leu Gly Thr Thr Ala 
-30 -25 -20 

Val val Ala Gly Val Ser Ser val Phe Ala Leu Arg Ser Val Arg Glu 
-15 -10 -5 

Ala Asn Leu Asn Pro Asn Ala Pro Leu Ala Asn Val Pro Gly Pro Gin 
-11 5 10 15 

Gly Ala Tyr Thr pro lie ser Ala Leu Gin Pro Val Val Pro Lys Asn 
20 25 30 

Ala Arg lie Asp His Tyr Thr Leu Thr Ala Glu ser Arg Thr Leu Thr 
35 40 45 

Val Gly Gly His Ala Leu Gin Ala Met Thr Phe Asn Gly Thr Ala Pro 
50 55 60 

Gly Pro Leu Leu val Ala His Gin Gly Asp Val val Lys val Thr val 
65 70 75 

His Asn Arg Leu Ser val Pro Leu Thr lie His Trp His Gly lie Ala 
80 85 90 95 

val Pro Gly Ala Glu Asp Gly Val Pro Gly val Thr Gin Asn pro lie 
100 105 110 

Pro Pro Gly Gly ser Tyr Thr Tyr Glu Phe Gin val Asn Gin Pro Gly 
115 120 125 

Thr Tyr Trp Tyr His Ser His Glu Ala Ser Phe Glu Glu Val Gly Leu 
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130 135 140 

Gly Leu Tyr Gly Ala Phe Val val Leu Pro Lys Arg Ala val His Pro 
145 150 155 

Ala Asp Arg Asp Tyr Thr Leu val Leu His Glu Trp Pro Thr Ala Ser 
160 ~ 165 170 175 

Thr Ala Gin Thr Met Met Ala Asn Leu Lys Ala Gly Asn Leu Gly Phe 
180 185 190 

Ser Ala Lys Gly Glu ser Ala Gly Met Gly Gly Met Gly Met Gin Gin 
195 200 205 

Asn Gly Asp Met Asn Gly Met Gly Met Met Gly Ala Ala Asp Gly Thr 
210 215 220 

Gly Gin Gly Gly Asn Ser Ala Ser Asp lie Ala His Val Leu Pro Gly 
225 230 235 

Pro Pro Leu Gin Leu Asn Gly Phe Ser Pro Thr Ala Asn Asp Trp Ala 
240 245 250 255 

Ala Leu Asp Glu Met Ala Gly Met Tyr Asp Ala Phe Thr Val Asn Gin 
260 265 270 

Asn Ala Ser Gly Thr Thr Leu Leu Pro Ala Lys Pro Gly Gin Leu val 
275 280 285 

Arg Leu Arg He val Asn Ser Gly Asn Met Thr His Leu Phe Thr Leu 
290 295 300 

val Gly Ala Pro Phe Arg Val Val Ala Leu Asp Gly His Asp He Ala 
305 310 315 

Asn Pro Gly Trp lie Arg Gly Val Leu Leu Pro Val Gly Ala Ala Glu 
320 325 330 335 

Arg Tyr Asp lie Glu Phe Arg Val Pro Lys Ser Gly Ala Ala Phe Leu 
340 345 350 

val cys Ala Asp Pro Asp Thr Thr Ala Gin Arg Glu Leu Arg Ala Ala 
355 360 365 

He Gly Leu Pro Asp Ala Trp Ser Gin Phe Lys Glu Thr Asp Ala Ala 
370 375 380 

ser Leu Glu Arg Ala Pro Trp Phe Asp Phe Thr His Tyr Gly Ser Gly 
385 390 395 

Arg Leu Pro Gly Glu Ala val Phe Arg Leu His Gin Ala Tyr Gin Val 
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400 405 410 415 

Arg Tyr Asn Met Lys Leu Thr val Gly Met ser Met Asn Gly Met val 
420 425 430 

Tyr Ala lie Asn Gly Lys val Phe Pro Asn lie Pro Pro lie Val val 
435 440 445 

Arg Lys Gly Asp Ala Val Leu Val His He val Asn Asp Ser Pro Tyr 
450 455 460 

lie His Pro Met His Leu His Gly His Asp Phe Gin val Leu Thr Arg 
465 470 475 

Asp Gly Lys Pro val Ser Gly Ser Pro lie Phe Leu Asp Thr Leu Asp 
480 485 490 495 

Val Phe Pro Gly Glu Ser Tyr Asp lie Ala Phe Arg Ala Asp Asn Pro 
500 505 510 

Gly Leu Trp Met Phe His Cys His Asp Leu Glu His Ala Ala Ala Gly 
515 520 525 

Met Asp val Met val Gin Tyr Ala Gly lie Arg Asp Pro Tyr Pro Met 
530 53 5 540 

Ser Glu Met Ser Glu 
545 

<210> 36 

<211> 245 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CI).. (29) 

<220> 

<221> mat_peptide 

<222> (30).. (246) 

<223> pepti dyl -prol yl -i somerase 

<400> 36 

Met Lys Arg Arg Thr Leu Leu Ala Gly He Thr Leu Ala Ala Leu Val 
-25 -20 -15 

Ala Val Ala Gly Cys Gly Thr Pro Ala Gly Asn Thr Ala ser Pro Asp 
-10 -5 -1 1 

Asn Thr Ala Asn Leu Ser Asn Thr Asn Ala Pro Asp Thr Leu Ser Asn 
5 10 15 
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Glu Thr Gly Gin Thr Leu Asp Thr Ala Asn Pro Pro Tyr Leu His Thr 
20 25 30 35 

Ser Thr Glu Gin Trp Lys Ser Met Pro Lys Met Phe lie Asn Pro Asn 
40 45 50 

Lys Thr Tyr Asp Ala lie Val His Thr Asn Tyr Gly Thr Phe Thr lie 
55 60 65 

Gin Leu Phe Ala Lys Asp Ala Pro lie Thr val Asn Asn Phe Val Phe 
70 75 80 

Leu Ala Glu His Asn Phe Tyr His Asp Cys Thr Phe Phe Arg lie val 
85 90 95 

Lys Asn Phe val lie Gin Thr Gly Asp Pro Arg Asn Asp Gly Thr Gly 
100 105 110 115 

Gly Pro Gly Tyr Thr lie Pro Asp Glu Leu Ser His Gin val Pro Phe 
120 125 130 

Thr Lys Gly lie val Ala Met Ala Asn Thr Gly Gin Pro His Thr Gly 
135 140 145 

Gly Ser Gin Phe Phe lie Cys Thr Ala Asn Asp Thr Gin val Phe Gin 
150 15 5 160 

Pro Pro Asn Asn Arg Tyr Thr Glu Phe Gly Arg Val lie Ser Gly Met 
165 170 175 

Asp val lie Asp Lys lie Ala Ala lie Pro val Thr Glu Asn pro Met 
180 185 190 195 

Thr Gin Glu Asp Ser Tyr Pro Leu Lys Thr Ala Tyr lie Glu ser lie 
200 205 210 

Gin He Gin Glu Ser 
215 

<210> 37 
<211> 608 
<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> (1)..(27) 

<220> 

<221> mat_peptide 
<222> C28). .(608) 

<223> acid phosphatase or a phytase or a phospholipase C 
<400> 37 
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Met Lys Lys Gly Lys Arg Trp Ser Ala Ala Leu Ala Thr Ser val Ala 
-25 ~ -20 -15 

Leu Phe Ala Thr Leu Ser Pro Gin Ala Leu Ala Ser Asp Thr Val Val 
-10 -5 -1 1 5 

Pro Gin Val Asn Thr Leu Thr Pro lie His His Leu Val Val lie Phe 
10 15 20 

Asp Glu Asn val Ser Phe Asp His Tyr Phe Ala Thr Tyr pro Asn Ala 
25 30 35 

Ala Asn Pro Ala Gly Glu Pro Pro Phe Tyr Ala Ala Pro Gly Thr Pro 
40 * 45 50 

Ser val Asn Gly Leu Ser Gly ser Leu Leu Thr His Asn Pro Asn Gly 
55 60 65 

val Asn Pro Gin Arg Leu Asp Arg ser Gin Ala val Thr Pro Asp Met 
70 75 80 85 

Asn His Asn Tyr Thr Pro Glu Gin Gin Ala Val Asp Gly Gly Arg Met 
90 95 100 

Asp Asn Phe lie Asn Thr val Gly Arg Gly Asn Pro lie Asp Leu Asp 
105 110 115 

Tyr Tyr Asp Gly Asn Thr val Thr Ala Leu Trp Tyr Tyr Ala Gin His 
120 125 130 

Phe Ala Leu Asn Asp Asn Ala Tyr cys Thr Gin Tyr Gly Pro Ser Thr 
135 140 145 

Pro Gly Ala lie Asn Leu lie ser Gly Asp Thr Ala Gly Ala Thr val 
150 155 160 165 

Tyr ser Ser Ser Glu Thr Ser Gly Ala Ala Gin Val val Pro Pro Gly 
170 175 180 

Ser Lys Asn Phe Pro Asn Ala Val Thr Pro Asn Gly val Asp lie Gly 
185 190 195 

Asp lie Asp Pro Tyr Tyr Asp Ser Ala Ser Lys Gly Met Thr Met Ala 
200 205 210 

Met Ala Gly Lys Asn lie Gly Asp Leu Leu Asn Ala Lys Gly Val Thr 
215 220 225 

Trp Gly Trp phe Gin Gly Gly Phe Ala Asn Pro Asn Ala Lys Asp Asn 
230 235 240 245 
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Asn He Ala Gly Thr Asp Glu Thr Thr Asp Tyr Ser Ala His His Glu 
250 255 260 

Pro Phe Gin Tyr Tyr Ala Ser Thr Ala Asn pro Asn His Leu Pro Pro 
265 270 275 

Thr Ser val Ala Met lie Gly Arg Thr Asp Gin Ala Asn His Gin Tyr 
280 285 290 

Asp lie Thr Asn Phe Phe Gin Ala Leu Gin Asn Gly Asn Met Pro Ala 
295 300 305 

val Ser Phe Leu Lys Ala Pro Glu Tyr Glu Asp Gly His Ala Gly Tyr 
310 315 320 325 

ser Asp Pro Leu Asp Glu Gin Arg Trp Leu val Gin Thr lie Asn Gin 
330 335 340 

lie Glu Ala Ser Pro Asp Trp Ser Ser Thr Ala lie lie lie Thr Tyr 
345 350 355 

Asp Asp Ser Asp Gly Trp Tyr Asp His val Met Pro Pro Leu val Asn 
360 365 370 

Gly Ser Ser Asp Lys Ala Val Asp Val Leu Gly Gly Thr Pro val Leu 
375 380 385 

Gin Asn Gly Thr Asp Arg Ala Gly Tyr Gly Pro Arg val Pro Phe Leu 
390 395 400 405 

val lie Ser Pro Tyr Ala Lys His Asn Phe val Asp Asn Thr Leu lie 
410 415 420 

Asp Gin Thr ser val Leu Arg Phe lie Glu Glu Asn Trp Gly Leu Gly 
425 " 430 435 

Ser Leu Gly Pro Ala Ser Tyr Asp ser Leu Ala Gly Ser lie Met Asn 
440 445 450 

Met Phe Asp Trp Asn Thr Gin Asn Pro Pro Val Phe Leu Asp Pro Thr 
455 460 465 

Thr Gly Glu Pro val ser Pro Asp Met Gin Pro Glu val lie Arg Gly 
470 475 480 485 

Thr Thr Tyr Leu ser Leu Asn His Tyr Ala Gin Asn Leu Asp Val val 
490 495 500 

Leu Gin Thr ser Arg Gly Met Ala Arg Phe Ser Tyr Glu Gly His Glu 
505 510 515 

Page 49 



10406. ST2 5 

Val Glu lie Asp Glu Arg Ser Gly Leu val Arg val Asp Gly Glu Ala 
520 " 525 530 

val His Leu Lys Ala Pro Leu val Arg val Asp Gly val Trp Met val 
535 540 ~ 545 

Pro val Glu Glu Met Asp Ser Leu lie Gly Ala Thr Leu His Thr Tyr 
550 555 560 565 

Thr Asp Gly His Leu Thr Tyr Tyr Leu Phe Ser Pro Gin Asp Ala His 
570 575 580 

<210> 38 
<211> 250 
<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> (1)..(25) 

<220> 

<221> mat_peptide 

<222> (26) . . (251) 

<223> polysaccharide deacetylase or a xylan deacetylase 

<400> 38 

Met Leu ser Leu Trp Lys Arg lie Arg Thr Gly Thr Leu Ser Leu Leu 

-25 -20 ~ " -15 -10 

Ala Ala Cys Ala cys Ala Leu ser Ala Met Gly Ala Gly Ala Gly Trp 
-5 -11 5 

Val His Ala Ala Glu Ser Gin Ala Gin Ala Pro Arg Ala He Tyr Lys 
10 15 20 

val Asp Thr Lys Glu Lys Val val Ala Leu Thr Phe Asp lie Ser Trp 
25 30 35 

Gly His Arg Thr Pro Glu Pro val Leu Glu Thr Leu Lys Lys cys Gly 
40 45 50 55 

val Thr Lys Ala Thr Phe Phe Leu Ser Gly Pro Trp Thr Met His His 
60 65 70 

Ala Asp lie Ala Lys Lys lie Lys Ala Met Gly Tyr Glu lie Gly ser 
75 80 85 

His Gly Tyr Leu His Lys Asp Tyr Ser Asn Tyr Pro Asp Ser Trp lie 
90 95 * 100 

Arg Glu Gin Ala Met Leu Ala Asp Lys Ala lie Gin Gin Val Thr Gly 
105 110 115 
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Val Lys Pro Lys Leu Phe Arg Thr Pro Asn Gly Asp Leu Asn Pro Arg 
120 125 130 135 

Val lie Arg Cys Leu Thr Ser Met Gly Tyr Thr val val Gin Trp Asn 
140 145 150 

Thr Asp Ser Leu Asp Trp Lys Asn Pro Gly val Asp Ala lie Val Asn 
1S5 160 165 

Arg val Thr Lys Arg val val pro Gly Asp He He Leu Met His Ala 
170 175 180 

Ser Asp Ser Ser Lys Gin He val Glu Ala Leu Pro Arg lie lie Glu 
185 190 195 

ser Leu Arg Gin Gin Gly Tyr Arg Phe val Thr val Ser Glu Leu Leu 
200 205 210 215 

Ala Gly Ala Ser val Gin ser Lys val Gin 
220 225 

<210> 39 
<211> 324 
<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> CI).. (21) 

<220> 

<221> mat_peptide 
<222> (22). .(324) 

<223> polysaccharide deacetylase or a xylan deacetylase 
<400> 39 

Met Arg Lys Thr Ala Ala Gly Ala Cys Ala Leu Ala Leu Met Gly Val 

-20 -15 -10 

Leu Gly Gly Trp Ala Gly Ala Ala Gly Thr Ala val Asn Ala His Ala 
-5 -11 5 10 

Pro Ala Ala Ser Ala Pro Ser val Ser Ala His Val Trp Glu Glu Val 
15 20 25 

ser Arg Thr Trp Gly Thr Leu Pro Val Asp Ala Arg His Asp Gly Val 
30 35 40 

Trp His Asn lie Pro Gly Leu ser Gly Phe Ala Leu Asp Thr Ala Ala 
45 50 55 

Ser Glu Arg Glu Thr Ala Arg Arg His Asp Gly Ala Leu His Leu val 

Page 51 



10406. ST2 5 

60 65 70 75 

Trp Arg Thr Leu Pro Pro Lys Arg Arg Leu Gly Asp Leu ser pro Asp 
80 85 90 

val He Tyr Arg Gly Pro Ala Gin Glu Lys Ser val Ala Leu Met val 
95 100 105 

Asn val Ser Trp Gly Asp Ala Tyr Val Pro Arg Met Leu Glu Val Leu 
110 115 120 

Arg ser Ala His val Lys Ala Thr Phe Phe val Asp Gly Ala Phe Ala 
125 130 135 

Lys Lys Phe Pro Asp Leu Val Arg Ala Met Ala Arg Asp Gly His Ala 
140 145 " 150 155 

val Glu ser His Gly Phe Gly His Pro Asp Phe Arg Arg Leu Ser Asp 
160 165 170 

Ala Lys Leu Ala Ala Gin Leu Asp Glu Thr Asn Arg val Leu Ala Gly 
175 180 185 

lie Thr Gly Lys val Pro Arg Leu lie Ala Pro Pro Ala Gly Ser Tyr 
190 195 200 

Asp Ala Arg Leu Ala Pro Leu Ala His Ser Arg Arg Met Tyr Ala lie 
205 210 215 

Leu Trp Thr Ala Asp Thr val Asp Trp Lys Asn Pro Pro Ala Asp Val 
220 225 230 235 

lie val Gin Arg Val Gin Arg Gly Ala Glu pro Gly Ala Leu lie Leu 
240 "* 245 2 50 

Met His Pro Thr Ala Pro Thr Ala Glu Ala Leu Pro Asp val lie Arg 
255 260 265 

Trp Leu Glu Gly His Gly Tyr Arg Leu Lys Thr val Glu Asp Val lie 
270 275 280 

Asp Glu Arg Pro Ala val Thr pro Pro Thr Thr Leu Ala Asn Glu Thr 
285 290 295 

Phe His Ser Ala 
300 

<210> 40 
<211> 214 
<212> PRT 

<213> Alicyclobacillus sp. 
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<220> 

<221> SIGNAL 
<222> CD.. (29) 

<220> 

<221> mat^peptide 

<222> (30).. (214) 

<223> sulfite oxidase 

<400> 40 

Met Met Arg Trp Asn Trp Lys val Ala val Gly Ser Leu Ala Leu Ala 
-25 -20 -15 

Ala Leu Gly Ala Gly Ala Ala Val Ser Pro Val Phe Ala Ala Ala Lys 
-10 -5 -11 

Ser ser Lys Ala Ala Gin Ser His Ala Glu Ala ser Ala Ala val Val 
5 10 15 

Met Ala Gly Lys Leu Tyr Gly Asn lie Pro Asn val Thr lie Arg Gly 
20 25 30 35 

val Glu Ala Gly Lys Ala Pro Trp val val Asp Gly ser Tyr Gin Leu 
40 45 50 

Lys ser Asn Leu Phe Thr Ala Ser Gly Lys Trp Leu lie lie Pro Lys 
55 60 65 

Gin Gly Tyr Met Glu Asn Gly Gin Pro val Pro Ala Lys lie Gly Gly 
70 75 80 

Thr Thr Asn Asn lie Pro Ala val Gly Ala Glu lie Thr Phe Ala Asn 
85 90 95 

Ala Ala Pro lie Val Leu Pro Pro val Lys Leu Ser Ser Gin Gly Asp 
100 105 110 115 

Phe Ser Phe His Asp Ala He Gin Trp Pro Lys Gly Ala Ala Gin pro 
120 125 130 

val lie Leu lie Gly Pro Glu Lys Asn Gly Gin Leu val Ala Trp Phe 
135 140 145 

Ala Ala ser Asp Phe Leu Ala Asp Tyr Gly Gin Ala Thr Gly Met Gly 
150 155 160 

Gly Gly Trp Val Asn Ala Ala His Pro Glu Thr Pro Val Arg His Thr 
165 170 175 

His Leu Ala Ser Lys Lys 
180 185 
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<210> 41 

<211> 257 

<212> PRT 

<213> Alicycl ©bacillus sp. 
<220> 

<221> SIGNAL 

<222> (1)..(21) 

<220> 

<221> mat_peptide 

<222> (22).. (257) 

<223> functional polypeptide 

<400> 41 

Met Asn Trp Ala Arg Val Gly Ala Trp val Ser Thr Trp Leu val Ala 
-20 -15 -10 

Thr Ala Leu Gly Ala Gly Cys Gly Thr Ala ser Gin Glu His pro ser 
-5 -11 5 10 

Asn Thr ser Thr Ser Asp His Arg val Ala Pro Ala Ala Pro Gly Gly 
15 20 25 

Ser Ala ser Met Gin Asn Arg His lie Leu Gin Glu Pro Leu Pro Arg 
30 35 40 

Gly val Lys Thr Glu Thr Asp Leu Tyr Asn Trp Leu Leu Trp Gin Arg 
45 50 55 

Leu Ala Glu lie Asn Asn Pro Ala Gin Gly Glu lie Cys Leu Asp Ala 
60 65 70 75 

Ala cys Lys lie Ala Ala Thr val Phe Ser Gly Pro Ala Lys Ala Ala 
80 85 90 

Ala Gly Thr Pro val Thr Leu Val Ala Phe Ser Pro Arg Ala Gly Trp 
95 100 105 

Gin val Leu val Gly Pro Leu Pro Gin ser Asp Asn Pro Pro Arg Gin 
110 115 120 

Ala Gin Ser He Thr Gly Gin Ser Ala Arg Leu Pro Ala Gin Arg Gly 
125 130 135 

Arg Met Arg Arg ser Asn Pro Arg Asn Arg Leu val Leu Asp Ser Gly 
140 ~ 145 " 150 155 

Arg Thr Pro Ala Ala Asp Ala Ser Ala Ala Arg Met Thr Arg Gin Leu 
160 165 170 

Arg Arg ser Ala ser Ser Thr Asn Ala Ser Arg ser Arg Arg Ala Lys 
175 180 185 
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Ser Met Ala Arg cys Gin Lys Ser Gly Cys val Arg ser Ala pro Met 
190 195 200 

Cys Phe Trp Ala Arg Ser Ser Thr Arg Met Arg Pro val Ser Arg ser 
205 210 215 

Asn Ala Thr Tyr Leu Ser Ala Asn Pro val Pro Ser Ala Glu Ala Met 
220 225 230 235 

Ala 



<210> 42 

<211> 1130 

<212> PRT 

<213> Alicyclobacillus sp. 

<220> 

<221> SIGNAL 

<222> CD . . (24) 

<220> 

<221> mat_peptide 

<222> C25). .(1130) 

<223> functional polypeptide 

<400> 42 

Met Lys Arg Thr Leu ser Gly lie Ala Ser Ala Ala lie Val Leu Gly 
-20 -15 -10 

Ala lie Ser Pro Met Ala Phe Ala Gin Thr Ser Ser Ser Gly Leu Thr 
-5 -11 5 

Pro Ala Gly Gin Leu Pro lie val val Asn Gly Gin Val Leu Ser Asn 
10 15 20 

Pro Tyr Glu Met Val Gly Met Asp Ser Gly Asn Lys Thr Gly Phe Phe 
25 30 35 40 

Pro lie Tyr Tyr Phe Asp Gin Ala Leu Glu Lys lie Gly lie Thr Ala 
45 50 55 

Thr Trp Asn Gly Ala Thr His Thr Trp Ala Leu Thr Asp Ser Asn val 
60 65 70 

Asn Ala Ser Asn Val Gin val Ala Gly Gly Met Gly Thr Gly Asn Thr 
75 80 85 

Thr val Thr Leu Asn Gly Thr Pro lie Lys Met Phe Tyr Thr Gin val 
90 95 100 

Ala Lys Asp Pro Ala Gly Gly Pro Val Thr Thr Tyr Met Pro lie Tyr 
105 110 115 120 
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Tyr lie Asn Asn lie Leu Ser Ala Leu Gly lie His Gly Thr Phe Ser 
125 130 135 

Gly Gin Thr Gly Leu Asn lie Thr Thr Gly Gin Thr Leu Ala Gly Ser 
140 145 150 

Leu Ser Ala lie Thr Val Thr Gly Ala Thr Ser Gly Thr Gly Thr Ser 
155 160 165 

Ser ser Pro Ala Val Ala Leu Asn Asn Gly Lys Val Thr Leu Ser Thr 
170 175 180 

Thr Leu Thr Asp ser Asn Gly Asn Pro lie Gly Asn Ala Ala Val Thr 
185 190 195 200 

Phe Asn Phe ser Glu Tyr Gly Ala Leu Pro Ser Asn Ala Pro Thr val 
205 210 215 

Thr Asn Ala Ser Gly Ala Thr lie Pro Ala Thr Thr Gly ser Thr Ala 
220 225 230 

Tyr Gin Tyr Thr val Tyr Thr Asn Ser ser Gly val Ala Ser lie Thr 
235 240 245 

val ser Gly Pro val Gly Leu Thr Tyr Ala Tyr Gin Val Thr Ala Thr 
250 255 260 

Ala Pro lie Ser Asn Gly Ser Asn Gin Met lie Ser ser Gin Pro Ala 
265 270 275 280 

Tyr val Glu Phe val Ala Asn Asn Gin Ala Gly lie Ala Pro Tyr Gly 
285 290 295 

Thr Ala Ser Gin Pro Tyr Ser Ala Ser Leu Gly Thr Ala val Pro lie 
300 305 310 

Thr val lie Leu Pro Pro Gly Ala Asn Gly Gin Pro Gin Ala Asn Val 
315 320 325 

Leu val Thr Leu Ser Leu Ser Asn Pro Asn Gly Gly Thr Asn Tyr Ala 
330 335 340 

Tyr Phe Thr Asn Ser Ser Gly Ala Asn Leu Gly Thr Gin lie Gin val 
345 350 355 360 

Thr Thr Asn Ser Ser Gly Val Ala Gin Ala Trp Val Ser Asp Ala Asn 
365 370 375 

Ala Gin pro Val Val Val Thr Ala Asn val Ser Asn Ala Thr Asn Val 
380 385 390 
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Ser Asn Thr Ser val Ser Thr Tyr Leu Asn Phe Gly Gin Ala Gly Val 
395 400 405 

Pro Ala Ser lie Ala Asn Tyr Asn Asp Pro Tyr Ser Ala Leu val Ala 
410 415 420 

Asn Gly Gin Gin pro Leu Ala Gly Thr Thr val Thr lie Thr Gly Thr 
425 430 435 440 

Leu val Asp Ala Ala Gly Asn Pro val Ala Asn Gly Gin Val Leu val 
445 450 455 

Thr Gly Ser ser ser Ser Gly Asp Phe Gly Tyr val Thr Thr ser Asn 
460 465 470 

Gly Lys Ser Thr Thr Thr Asp Phe Pro ser val Gly Thr Leu Gin Pro 
475 480 485 

Gly Gin Pro val ser ser Ala Leu Gly Asp val lie Thr Ala Asp Ala 
490 495 500 

Asn Gly Asn Phe Ser Leu Gin Val Thr Asp Thr Gin Asn Glu Gin Ala 
505 510 515 520 

Ser Leu Thr Phe Tyr Ser val Ser Asn Gly Val lie Ser Pro Val Gly 
525 530 535 

val lie Lys Thr Asp Thr Leu Lys Phe Ala val Asn Asn Gin Leu Ser 
540 545 550 

Thr lie Ala Leu Gly Ala Thr Asp Ala Gin Ala Asp Gly Asn Gin Tyr 
555 560 565 

Thr Asn Leu Thr Gly Leu Thr Gly Ser Asp Asn Ala Pro Val Pro Val 
570 575 580 

Tyr val Asp Pro Gin Asn Pro Ser Gly Thr Met val Thr Asn Gin ser 
585 590 595 600 

lie Thr Tyr Thr Leu Ser Val Ser Ser Gly Asp lie Val Gly lie Gly 
605 610 615 

Ser Gly Ala Tyr Leu Ala Pro Thr Asn Ala Asn Asn ser Thr lie Pro 
620 625 630 

lie Asn ser Gly Asn Gly Leu Ser Ser Val Gin Val Thr val Thr Ala 
635 640 645 

Leu Gly Asn Asn Gin Tyr Gin lie Ser val Pro Gly Gin Gin Gly val 
650 655 660 
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Leu Thr Thr Ser Ser Pro Asp Phe Thr val Leu val uys Gly Ser Thr 
665 670 675 680 

Gly Ser Thr Lys Leu Thr val ser ser Gly Ser Leu Ser ser Thr Ala 
685 690 695 

Thr lie Thr Phe Thr ser ser Asn Pro Thr val val Ala Ser Leu Thr 
700 705 710 

Pro val ser ser val Leu Ala Ala Gly Gin Asn Glu Thr val Thr Phe 
715 720 725 

thr val Glu Asp Ala Asp Gly Asn Pro Val Ser Gly Asn Thr Gin Val 
730 735 740 

Ala lie Thr Ala His Asp ser Asn Asp Pro Leu Trp lie Thr Ala val 
745 750 755 760 

Asn Gly Thr Asn Leu Ser Glu Tyr Glu Thr lie Asn Gly Ala Ala Thr 
765 770 775 

Ser val Ser Thr Pro lie Pro Leu Gly Thr Ser Ser Tyr Ala Thr Ser 
780 785 790 

Gly Gly Ser Thr Leu Tyr pro Ala Tyr Thr Asn Ser Gly Tyr Phe Lys 
795 800 805 

Asn Gly Val Ser lie Ser Gly val val ser Trp Asp Gly Thr val Gly 
810 815 820 

Asp Pro lie Tyr Val Thr Thr Asn Ser Gin Gly Gin Val Thr Leu Thr 
825 830 835 840 

Leu Gin Asn Gly Asn Val Thr Tyr Phe Asp Gly Asn Asn Thr Thr Leu 
845 850 ~ 855 

Ser Asn Gly He Ser val Ala Gly Thr ser Gly Ser Glu Gly Phe Tyr 
860 865 870 

Thr Tyr Ser ser Asp Thr Ala Ala Thr Ala Ser Asp Leu Thr Asn Met 
875 880 885 

Gly val Leu Val lie Gly Gin Ala Asn Gly Asp Ala ser Thr Ser Leu 
890 895 900 

Gly Thr lie Tyr lie Gly ser Gly Gly Ala Thr Gin Thr Pro Ala Ala 
905 910 915 920 

Phe Thr Tyr Val Asp Ala Asn Asn His ser Tyr Thr Tyr Ser Asn Thr 
925 930 935 
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Ser Asp Thr Phe Thr Val Ser ser Thr Gin Ser val Ser Gly Gly Asn 
940 945 950 

Tyr Ala He Thr ser Phe Thr Pro val Gly Gly Thr Ala Thr ser Thr 
955 960 965 

lie Pro Ser Gly val Ser val Asn Ser ser Thr Gly Thr val Ser val 
970 975 980 

Ser Gin Asn Ala Ala val Gly Thr Tyr Thr val Ser Tyr Tyr Leu Asn 
985 990 995 1000 

Gly val Thr Glu Ser Thr Gly Thr Phe Lys val Tyr ser Gly Ser 
1005 1010 1015 

Gly val Ala Pro Thr Glu lie Thr Gly Ser Ser Val Thr val Pro 
1020 1025 1030 

Ala Ala Thr Tyr Ser Gly Thr Leu Lys Val Thr Val Ser Asn Gly 
1035 1040 1045 

Gly Ser Pro Leu Tyr val Asn val Thr Ala Gly Glu Ser Ala Asn 
1050 1055 1060 

Ala val Ala Ala Ala lie Tyr Asn Ala Leu Val Asn Ala Asn lie 
1065 1070 1075 

ser Gly Asp Thr phe Ser val Ser Gly Ser Thr val Ser val Thr 
1080 1085 1090 

Ala Ala Ser Gly Ser Pro Thr Leu Thr val val Asp Ala Thr Asn 
1095 1100 1105 

Phe 



<210> 43 

<211> 248 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> (1). .(41) 

<220> 

<221> mat__peptide 

<222> (42). .(248) 

<223> functional polypeptide 

<400> 43 

Met Arg lie Met Lys val Leu Gly Trp He Leu val Pro Tyr lie Met 
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-40 -35 -30 

Leu Phe He Gin Trp Gly Arg Met Asn Arg He Leu Arg Phe Ala Gly 
-25 -20 -15 -10 

Ser Leu Trp Ala Leu lie val Phe Ala Asn Thr val Tyr Met lie Arg 
-5 -11 5 

Gly Asn Thr Pro Arg Asn Ala Ser Thr val Ser Ala Thr Thr Ser Leu 
10 15 20 

val Asn ser Thr Asn Ser ser Gin val Ala Lys Gin Glu Gin Asn Ser 
25 30 35 

Ser Thr ser Pro Ala His Lys Ser Thr Asn Ser Leu Gin His Ala Gin 
40 45 50 55 

His Gin Ala Ala Thr Thr ser Ser Ser Gin Ser Lys Leu Arg Tyr lie 
60 65 70 

pro Phe His Thr Tyr Gly Lys val Gly Asp Leu Glu lie Arg val Asn 
75 80 85 

Ser Leu Gin Gin Val Lys Ser val Gly Tyr Asp Gly lie Gly Glu Thr 
90 95 100 

Ala Asn Gly Ala Phe Trp val lie Asn lie Thr lie Arg Asn Asp Gly 
105 110 115 

ser Thr pro Met Glu val val Asp Gly lie Phe His Leu Gin Asn Leu 
120 125 130 135 

Asn Gly Asn val Tyr Gin Pro Asp Ser Thr Ala Glu He Tyr Ala Asn 
140 145 150 

Thr Asn ser Gly Thr lie Pro Thr Asp Leu Asn Pro Gly val Ser Met 
155 160 165 

Thr Thr Asn Leu val Phe Asp Met Pro Asp Phe Met Thr Tyr Gly His 
170 175 180 

val Gly Gin His Tyr Ser Leu val Ala ser Met Gly Phe Phe Gly Ser 
185 190 195 

Asp Glu Thr Thr Tyr Ala Leu Pro 
200 205 

<210> 44 
<211> 172 
<212> PRT 

<213> Alicyclobacillus sp. 
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<220> 

<221> SIGNAL 
<222> CD.-C25J 

<220> 

<221> mat_peptide 

<222> (26).. (172) 

<223> functional polypeptide 

<400> 44 

Met Asn Arg Lys ser Met Leu ser val Leu Gly val Ala Ala Ala val 
-25 -20 -15 -10 

Ala Leu Met val Thr Gly Cys Gly Thr Ala Asn Ser Thr Asn Asn Thr 

-5 -11 5 

Ala Ser ser Gly Ala Ala Ser Thr Ala Val Thr Val Lys His Glu His 
10 15 20 

Lys Gly Ala Asn Ala ser Lys Thr Glu Thr Lys Gin Thr Glu Ala Lys 
25 30 35 

ser ser Asn Lys Ala Gly Glu Thr Ala Lys Ser ser val Lys Leu Thr 
40 45 50 55 

Ala Pro val Ala Gly Ala Thr val Thr Ala Gly Gly Thr Leu Lys val 
60 65 70 

ser Gly Gin val ser ser Asn Leu Ala Lys Lys Asp val Gin lie Thr 
75 80 85 

Leu Thr Asn Ser Ala Lys Lys Val Leu Val Gin Gin lie val Gly Thr 
90 95 100 

Asn Ser Thr Gly Ala Phe Val Asp Thr Leu Lys Leu Pro Lys Tyr Leu 
105 * 110 115 

Gly Lys Ala Gly Ser Asp Leu Thr Leu Ser val ser Val Val Gly Glu 
120 125 130 135 

Asn Gly val Val Ser Thr Leu Ser Leu His Val Lys 
140 145 

<210> 45 
<211> 242 
<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> (1). .(30) 

<220> 

<221> mat_peptide 
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<223> functional polypeptide 
<400> 45 



<222> (31). .(242) 
fi 



Met Arg Arg Ala val Arg lie Leu Ala Ala Leu Leu Phe Gly Leu Ala 
-30 -25 -20 -15 

Thr val Thr Ala Tbr Leu Met Phe val Pro Gin Ala Arg Ala Ala Thr 
-10 -5 -1 1 

val Thr Gly Ala Leu Ala Gin ser Gin val val Ser lie Thr Gly Gly 
5 10 15 

Tyr Asn Thr Thr Thr Gin Met Tyr Glu Gin Thr Gly Gin Gin Thr val 
20 25 30 

Val Thr Asn Trp Thr Phe Ser Leu Gin Gin Thr val Asn Gin Asn Asn 
35 40 45 50 

Glu Asn pro ser Tyr Ala Gin Cys Thr val Leu Ala Gly Asn Gin Gin 
55 60 65 

Val Thr Cys Thr Ser Asp Ala Thr Asn Asn Gly Ala lie Cys Thr Ser 
70 75 80 

Pro Tyr Pro Gly Ala lie Asp Lys Gin Cys Thr Asn Leu lie Gly Phe 
85 90 95 

Thr Gly Asn lie Ser Val ser Ser Gin Asn Gly Asn Pro Thr phe Thr 
100 105 110 

Phe Ser Leu Pro ser lie Asp Pro Ser Thr Met Lys Pro val Gly lie 
115 120 125 130 

Phe Val Thr Pro Glu Thr lie Tyr Gly Gin Met Gly Thr Gly ser Glu 
135 140 145 

Ser Tyr Leu ser ser Gly Gin Ser Gly Gly Trp Ser Phe Asn Phe ser 
150 155 160 

Asn val Ser Asp Pro Gin Asp Trp Tyr Phe Leu Leu Glu Phe Leu Ala 
165 170 175 

Asn Pro lie val Ala Ala lie Ala Val Pro Thr Thr Gin Thr Val Pro 
180 185 190 

lie Tyr ser Trp val Thr Thr Thr Val Trp His Pro Val Gin He Ser 
195 200 205 210 

Tyr Ser 
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<210> 46 

<211> 180 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> (1),.(24) 

<220> 

<221> mat_peptide 

<222> (25). .(180) 

<223> functional polypeptide 

<400> 46 

Val val Arg Met Arg Lys Arg Leu Gly Leu val Leu ser Met val Thr 
-20 -15 -10 

Ser val Leu Val Gly Cys Gly Ala ser His Pro Ser Pro Leu Asn Gin 
-5 -11 5 

Asp Lys Ser Leu Leu Thr Trp Asn Ala Ala Lys His Glu Val Arg Trp 
10 15 20 

Lys val val Ala Gly Asp Gly Arg Ala Asn Gly Gly Met Asn Phe Asp 
25 30 35 40 

Gly Tyr Ala Asn Gly Ser Met Thr Leu val val Pro lie Gly Trp Arg 
45 50 55 

val val lie Asp Phe Asp Asn Ala ser Leu Met Pro His Ser Ala Met 
60 65 70 

Val Val Pro Tyr Gly Asp Arg Glu Arg Ser Asn Phe Asp Ala Thr Met 
75 80 85 

val Ala Phe Pro Gly Ala Glu Thr Pro Asn Pro Ser Gin Gly Asp Pro 
90 95 100 

Gin Gly Thr His Arg Asp Val lie Phe Thr Ala Ala Lys Val Gly Thr 
105 110 115 120 

Tyr Ala Leu Val cys Gly val Pro Gly His Ala Leu Ala Gly Met Trp 
125 130 13 5 

Asp Gin Leu Val val ser Asp Glu Ala Lys His Pro Ser Leu Arg val 
140 145 150 

Gin Arg Asp Ser 
155 



<210> 47 
<211> 477 
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<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 
<222> (1). .(25) 

<220> 

<221> mat_peptide 

<222> (26)..C477) 

<223> functional polypeptide 

<400> 47 

Met Ala val Arq Arg Ala Trp Leu Leu Ala Pro Leu Cys Ala Ser Ser 
-25 -20 -15 -10 

Leu val val Pro Ala Ser Val Gin Ala Gly Leu Ala Gin Gly His Gly 
-5 -11 5 

Ser Phe Ser Thr val Arg Val Ser Val Gly Thr Ser Ser Ser Leu Ser 
10 15 20 

Val Pro Ala Leu lie Gin Gly Asn Glu Thr Tyr lie Pro Leu Trp Asp 
25 30 35 

Leu Met Gin val Leu His Gin Leu Gly Phe Thr Ala Thr Trp Ala Lys 
40 45 50 55 

Gly Gin Phe Ser val Ser Ala Pro Pro ser val Pro Met Asp Glu Ala 
60 65 70 

Pro Gly Pro Ala Gly Lys Gly Gly Ala Leu Val val Leu Asp Gly Gin 
75 80 85 

val val Glu Gin val Pro Thr Val lie Ala Thr pro Pro Gly Ala Ala 
90 95 100 

Thr Pro Glu val Phe Leu Pro Leu Thr Asn Ala Glu Glu lie Leu Gly 
105 110 115 

Arg Leu Gly lie Gin Ala Ser Ala Thr Gly Asn Gin val Asn Leu Asp 
120 125 130 135 

Ala Ser Ala val Pro Gin Ala Leu Pro Asn Gin Gin val Ala val Trp 
140 145 150 

Asn val Leu Ala Ala val Ala Ser Asp Leu Gly val Ser Thr Ala Pro 
155 160 " 165 

Ala Gly Pro ser Pro Tyr Ala Asp Leu Pro Thr Ala Ser Pro Ala Trp 
170 175 180 

Gly Ala val Glu Ala Ala lie Arg Leu Gly Trp Tyr Ser Pro Leu Ser 
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185 190 195 

Ala ser ser ser Gly Ala Phe Gin pro lie Thr Trp Ala Gin Thr Ala 
200 205 210 215 

Ser lie Leu Trp Asn Ala Leu Gly lie ser Gin Gin Asp Ala Ala Tyr 
220 225 230 

Gin Pro Gly Gly Ser Pro Thr Ala Trp Ala Ser Ala Leu Gly Leu Val 
235 240 245 

Pro Glu Asn Trp Asp Pro Ala Ser Tyr Met Thr Ala Gin Glu Leu Asp 
250 255 260 

Thr Leu Ala Ser Asn Leu His Glu Cys Leu Gin Gly Asp val Glu Thr 
265 270 275 

Gly Ala Asn Thr Trp Arg Leu Trp Tyr Pro Pro Ala Asp Glu val Glu 
280 285 290 295 

Ala Thr Leu Gin Ser Gly Gly Gly Gin ser Leu Phe Thr Ser Thr Ala 
300 305 310 

Asp Ala Gin Ala Ala lie ser ser Ala Tyr Gin Phe Phe Asn Gin Leu 
315 320 325 

Val Val Thr Arg val Gly Gin Gly Tyr val val Thr Val Pro Ser Val 
330 335 340 

Pro Glu Gly Tyr Gly Phe Ala Thr Phe Ser Ala Leu Gly Gly val Ala 
345 350 355 

Tyr Gin Thr Thr Pro Gly Gly Pro Trp Thr val Val Pro Val Leu Asp 
360 365 370 375 

Thr Arg Asp Val Ser lie Pro Ala Lys Gly Arg Leu Ser Val Lys val 
380 385 390 

Pro Ala Gin Gly lie Thr lie Thr Trp Asn Gin Met Met Pro Ser Leu 
395 400 405 

Gly Gly Thr val Ala Met Gly Ala Leu Gin Val Ser Pro Gly Pro Ser 
410 415 420 

Gly Pro ser val Glu Arg Leu Asn lie val Thr Pro Asn Leu Pro Pro 
425 430 435 

Val Leu Pro Ser Ser val Thr Ser Thr Gin Pro Gin Ser 
440 445 450 

<210> 48 
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<211> 340 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CD . . (19) 

<220> 

<221> mat_peptide 

<222> (20) . . (340) 

<223> functional polypeptide 

<400> 48 

Met Asn Arg Gin Trp Arg Leu Ala Val Ala Thr Ser Ala val Ala Ala 
-15 -10 -5 

ser Leu Ala Gly Cys Gly Ala Pro Asp Leu Ala Ala Met Arg Pro Thr 
-11 5 10 

val Gin Lys Ser Ala val Leu val Glu val val Gly Ala Pro Pro Phe 
15 20 25 

Ala Pro Ser Ala ser Gin Leu Gly Thr Ala Gly Ala Thr Ser val Glu 
30 35 40 45 

Val val His Val Ala Leu Gly Glu Trp Gin ser Val Ala Ala His Ala 
50 55 60 

Leu Ala Lys Gly Gin Leu Thr Gly Val Met val val cys Asp Asp Ala 
65 70 75 

Asn Ala val Ala ser Gly Leu Asn Gin Leu Ala Ala Asp His Pro Asp 
80 85 90 

Val Arg Phe Leu val Val ser Asn Trp Pro Ala ser Gin lie Thr Ser 
95 100 105 

Gly Asn val Glu Asp val Ala Gin Asp Pro val Ala val Ala Tyr Ser 
110 115 120 125 

lie Gly Ala Leu Cys Gly Asp Trp lie Ala Ser ser Thr Ser Thr ser 
130 135 140 

Gly Ala Val Tyr ser Gly val Pro ser lie Val Tyr Ala Pro Arg Gly 
145 150 155 

Ala Thr val Ala Glu Gin Lys Ala Phe Phe Thr Gly Leu Tyr Gin Ala 
160 165 170 

Asn Pro Asn val Arg Val Val Ala Leu Pro Gin Pro Ala Ala Gin Ser 
175 ~ 180 185 
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Leu Ser ser Tyr Gly Tyr Ala Val Asp Leu Gly Val Val Gly Gly Ser 
190 195 200 205 

Pro Ala Ala Gly Glu Leu Ser Ala Leu Arg ser Ala Ala Pro Ala Trp 
210 215 220 

Ala Ala Phe Gly Thr ser Pro lie Ala Gly Phe Ala lie ser Pro Gly 
225 230 235 

His Leu Ser ser ser Glu Ala val Gin Ala Phe Gin Ala Leu val Ser 
240 245 250 

Pro Asp Ala Trp His Ser Gly Glu His Leu val Leu Asp Leu ser ser 
255 260 265 

val Ala Phe Asp Asp Lys Gin val Pro Ala Thr val lie Ala Ala Trp 
270 275 280 285 

Ala Lys Leu Glu val Asn Ala lie Ala Ala Ala Ala Gin ser Asn Ala 
290 295 300 

Ala Phe Ala Ser Leu Pro Pro Ser Val Arg Ser Asp Leu Ala Asn Ala 
305 310 315 

Phe His Leu Ser 
320 

<210> 49 

<211> 341 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> (1)..(29) 

<220> 

<221> mat_peptide 

<222> C30). .(341) 

<223> functional polypeptide 

<400> 49 

Met Val Met Arg Thr Arg Trp lie Arg Trp Met Ala Leu Ala Leu Ala 
-25 -20 -15 

Val Cys Val Trp Leu ser Pro Phe Pro Phe Ser Trp Gly Ala Thr Ser 
-10 -5 -1 1 

Leu Asp Ala Asp Leu Pro Gin Pro Thr lie Pro Pro Ser Ala Trp Ser 
5 10 15 

Asn Leu Asn Gin Asp Trp Lys Asp Leu Gin Arg Leu Ala Gin Asn Thr 
20 25 30 35 
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val Pro Pro ser Lys Glu ser ser Gin Thr His Ala Pro Thr His Lys 
40 45 50 

Ser ser Gin Pro Pro Ala Gin val Pro Gin Gly Pro Leu val Gly val 
55 60 65 

Gly Asp Thr Gly Glu Ala Ala Arg Trp Leu Asn Glu Ala Leu Ala Val 
70 75 80 

Leu Gly Tyr Leu Pro Ala Val Phe ser pro Ala Ala Gin Thr Ser Thr 
85 90 95 

Arg Gin val Arg Leu Ala Leu Ala Ala Ser Ala Glu His Gin Thr Leu 
100 105 110 115 

val Pro lie Pro Gly Ser Phe Gin Leu Leu Tyr His Ala Pro Ser Ser 
120 125 130 

Trp val Ala Leu Trp Ser Ala Asp Glu Asp Thr Pro lie Thr Glu Gly 
135 140 145 

Ala val Met Ala Phe Glu Ala Gin His His Leu Gly Val Asp Gly lie 
150 155 160 

Ala Gly Pro Asp Val He His Ala Leu Ala Gin Ala Leu Ala Gly Asn 
165 170 175 

Glu Thr Ala Glu Lys Ala Pro Tyr ser Tyr He Leu Val Thr Thr Ser 
180 185 190 195 

Leu Pro Glu Thr Leu Glu Leu Trp val Asn Gly Gin Leu val Leu Lys 
200 205 210 

Ser Leu Cys Asn Thr Gly lie Ala Gin ser Pro Thr Pro Tyr Gly Thr 
215 220 225 

Tyr Gly val Tyr val Gin Tyr Thr ser Gin Glu Met Lys Gly Lys Asp 
230 235 240 

Pro Asp Gly Thr Pro Tyr Asp Asp Pro Gly Val Pro Trp val Ser Tyr 
245 250 255 

Phe Tyr Lys Gly cys Ala val His Gly Phe Leu Arg Ala Lys Tyr Gly 
260 265 270 275 

Phe Pro Gin Ser Leu Gly Cys val Glu Leu Pro Tyr Ala Ala Ala Lys 
280 285 290 

Thr val Phe Ser Tyr Thr His lie Gly Thr Leu val Thr val Thr Ala 
295 300 305 
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Ser Pro Leu Ser Ala 
310 

<210> 50 

<211> 399 

<212> PRT 

<213> Alicyclobacillus sp. 
<220> 

<221> SIGNAL 

<222> CD . - (28) 

<220> 

<221> mat_peptide 

<222> (30).. (399) 

<223> functional polypeptide 

<400> 50 

Met Asp Arg Leu Leu Asn Asn Lys val Ala Leu Arg Leu Thr Ala Leu 
-25 -20 ~ -15 

val Leu Ala cys lie Leu Trp Leu Ala val His Ala Glu Gin Gly Ser 
-10 -5 -11 

Gly Ser ser Ala Ser Thr Gly Val Thr Glu Ser Phe Glu Leu Pro Val 
5 10 15 

Arg val Glu Thr ser Ala Asp Glu Val Leu val ser Gin val Pro Thr 
20 25 30 35 

He Thr Ala Arg val Thr Thr Asn Leu Leu ser Leu Pro Thr Leu Ala 
40 45 50 

Ser Asp Met Met Lys Ala Glu lie val Ala Asp Ala Glu Asn Leu Gly 
55 60 65 

Pro Gly Thr Tyr Thr Leu His Val Ala Ala val Asn Met Pro Ala Gly 
70 75 80 

Val Arg Ser Tyr Thr Leu Thr Pro Ser Thr lie Thr Val Thr Leu Glu 
85 90 95 

Pro Lys Val Thr Val Glu Arg Thr val Arg val Asn val val Gly Thr 
100 105 110 115 

Pro Gly Gin Gly Tyr Val Leu Gly Lys Pro Glu Leu Gly Ala Gly val 
120 125 130 

val Glu val Ser Gly Ala Glu Ser ser val Gin Ala val Ala Glu val 
135 140 145 

Ala Gly val val Asp Ala ser Gly Leu Ser Gin Thr Ala Thr Lys Leu 
150 155 160 
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val Glu Leu Leu pro Leu Asp Gin Ala Gly Lys Ala val Pro Gly Val 
165 170 175 

Thr val Thr Pro Ser Ala lie Ser Val Thr Leu Pro lie Thr Ser Ala 
180 185 190 195 

asii Gin Ala val Lys Leu Thr Pro Ala val Thr Gly Ser Pro Ala Pro 
200 205 210 

Gly Tyr Ala Val Ala Ser Val His Leu Glu Pro Ala Ser Ala val Glu 
215 220 225 

Gin Gly Leu Ala Ala Ser Gin Leu Pro Gin Arg Gly Leu Leu val Pro 
230 235 240 

He Asp Val Thr Gly Leu Asn Arg Pro Thr Thr Val Ser val Pro Val 
245 250 255 

Pro Leu Leu Pro Gly Met Thr Ser val Ser Pro Thr Ala val Thr Ala 
260 265 270 275 

val lie Asp val Glu Pro Ser Ala val Tyr Thr val Ser Asn val pro 
280 285 290 

val Ala lie Thr Gly Ala Thr Gly val Lys Leu val Thr pro Arg Thr 
295 300 305 

val Asn val Thr val Thr Gly lie Glu Ala Asp val Arg Ala val Glu 
310 315 320 

Arg Asp pro Ala Ala val Gin Ala Phe val Asp Ala Thr Gly Leu Thr 
325 330 335 

His Gly Ser Ala Thr Leu Pro Asp Ser Asn Ser Ser Ala val Leu Ser 
340 345 350 355 

Leu val lie Arg pro Arg Glu Arg Arg Lys Arg Thr His Val Val 
360 " 365 " 370 

<210> 51 
<211> 34 
<212> DNA 

<213> Primer sigA2NotU-P 
<400> 51 

tcgcgatccg ttttcgcatt tatcgtgaaa cgct 

<210> 52 
<211> 33 
<212> DNA 

<213> Primer SigA2NotD-P 
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<400> 52 

ccgcaaacgc tggtgaaagt aaaagatgct gaa 



33 



<210> 53 

<211> 20 

<212> DNA 

<213> Primer A2up 

<400> 53 

agcgtttgcg gccgcgatcc 20 



<210> 54 

<211> 21 

<212> DNA 

<213> Primer B 

<400> 54 

ttattcggtc gaaaaggatc c 21 



<210> 55 

<211> 282 

<212> PRT 

<213> Aspergillus niger 



<220> 

<221> SIGNAL 

<222> CD.. (18) 

<220> 

<221> PROPER 

<222> (19). .(59) 

<220> 

<221> CHAIN 

<222> (60) . . (98) 

<220> 

<221> PROPEP 

<222> C99)..(109) 

<220> 

<221> CHAIN 

<222> (110).. (282) 

<220> 

<221> MOD_RES 

<222> (110).. (110) 

<220> 

<221> DISULFID 

<222> (115).. (139) 

<220> 

<221> DISULFID 

<222> (127). .(210) 

<400> 55 

Met Lys Phe Ser Thr He Leu Thr Gly Ser Leu Phe Ala Thr Ala Ala 
15 10 15 



Leu Ala Ala Pro Leu Thr Glu Lys Arg Arg Ala Arg Lys Glu Ala Arg 
20 25 30 
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Ala Ala Gly Lys Arg His ser Asn Pro Pro Tyr lie Pro Gly Ser Asp 
35 40 45 

Lys Glu lie Leu Lys Leu Asn Gly Thr Thr Asn Glu Glu Tyr Ser ser 
50 55 60 

Asn Trp Ala Gly Ala val Leu lie Gly Asp Gly Tyr Thr Lys val Thr 
65 70 75 80 

Gly Glu Phe Thr val Pro Ser val Ser Ala Gly Ser Ser Gly Ser Ser 
85 90 95 

Gly Tyr Gly Gly Gly Tyr Gly Tyr Trp Lys Asn Lys Arg Gin Ser Glu 
100 105 110 

Glu Tyr cys Ala Ser Ala Trp val Gly lie Asp Gly Asp Thr cys Glu 
115 120 125 

Thr Ala lie Leu Gin Thr Gly Val Asp Phe Cys Tyr Glu Asp Gly Gin 
130 13 5 140 

Thr Ser Tyr Asp Ala Trp Tyr Glu Trp Tyr Pro Asp Tyr Ala Tyr Asp 
145 150 15 5 160 

Phe Ser Asp lie Thr lie ser Glu Gly Asp Ser lie Lys val Thr val 
165 170 175 

Glu Ala Thr ser Lys Ser Ser Gly Ser Ala Thr val Glu Asn Leu Thr 
180 185 190 

Thr Gly Gin ser val Thr His Thr Phe Ser Gly Asn val Glu Gly Asp 
195 200 205 

Leu Cys Glu Thr Asn Ala Glu Trp lie Val Glu Asp Phe Glu Ser Gly 
210 215 220 

Asp ser Leu val Ala Phe Ala Asp Phe Gly Ser val Thr Phe Thr Asn 
225 230 235 240 

Ala Glu Ala Thr Ser Gly Gly Ser Thr Val Gly Pro Ser Asp Ala Thr 
245 250 255 

val Met Asp lie Glu Gin Asp Gly Ser val Leu Thr Glu Thr Ser val 
260 265 270 

Ser Gly Asp ser Val Thr Val Thr Tyr val 
275 280 
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